Random Sampling

May 21, 2012

Random sampling of data is critical to confirm accuracy of search results in E-Discovery applications.  Following article is a good description of the math behind sample calculations.  Prevalence of relevant documents is an important aspect of the calculations

http://tinyurl.com/cm8yals


Hadoop/Hbase vs RDBMS

March 27, 2012

good presentation comparing Hadoop/Hbase vs RDBMS

http://goo.gl/AakTO