Hadoop:
- Open source, reliable, distributed computing
Two Core Components:
- HDFS: Distributed replicated file system, Self-healing high-bandwidth clustered storage, just stores bytes,
- Map/Reduce: API for parallel computing, Fault-tolerant distributed processing, a batch system
Feature:
- Hadoop scales linearly with data size or analysis complexity
TO Not NoSQL
- Hive project adds SQL support to Hadoop
- HiveQL compiles to a query plan
- Query plan executes as MapReduce jobs
Hadoop users
Other words
- Zookeeper – distributed synchronization
- Avro – Data Serialization / RPC
- H-BASE – structured distributed database for horizontally scalable FS
EcoSystem of Hadoop

Useful links:
http://hadoop.apache.org/common/
http://www.cloudera.com/
http://www.slideshare.net/cloudera/tokyo-nosqlslidesonly
http://www.slideshare.net/xefyr/introduction-to-hadoop-hbase-and-nosql
http://www.slideshare.net/adorepump/hbase-nosql