if you are still working on Hadoop, you might be outdated.
Check out what Google is working on – http://tinyurl.com/7usxren
under the hood is strong doses of caffeine that literally wakes up and indexes on the fly – http://tinyurl.com/7gna225. Per Google, Caffeine is a fundamental re-architecting of how the indexing system works