Search notes:

Hadoop MapReduce

Hadoop MapReduce is a framework for processing large datasets in parallel using a cluster of lots of computers.
It behaves as if were one big computer.
Hadoop's MapReduce is a disk based two-stage paradigm. Apache Spark, in contrast, uses an in-memory primitive.

Architcture

batch-based: it is therefore weak when dealing with real-time data.
shared nothing: might be problematic for some algorithms

See also

MapReduce

Index

Fatal error: Uncaught PDOException: SQLSTATE[HY000]: General error: 8 attempt to write a readonly database in /home/httpd/vhosts/renenyffenegger.ch/php/web-request-database.php:78 Stack trace: #0 /home/httpd/vhosts/renenyffenegger.ch/php/web-request-database.php(78): PDOStatement->execute(Array) #1 /home/httpd/vhosts/renenyffenegger.ch/php/web-request-database.php(30): insert_webrequest_('/notes/developm...', 1759398440, '216.73.216.42', 'Mozilla/5.0 App...', NULL) #2 /home/httpd/vhosts/renenyffenegger.ch/httpsdocs/notes/development/Apache/Hadoop/MapReduce/index(50): insert_webrequest() #3 {main} thrown in /home/httpd/vhosts/renenyffenegger.ch/php/web-request-database.php on line 78