tom-e-white.com
Tom White: January 2015
http://www.tom-e-white.com/2015_01_01_archive.html
Problems worthy of attack prove their worth by hitting back. —Piet Hein. Friday, 16 January 2015. Some of the largest datasets are generated by the sciences. For example, the Large Hadron Collider produces around 30PB of data a year. I'm interested in the technologies and tools for analyzing these kind of datasets, and how they work with Hadoop, so here's a brief post. Amazon S3 seems to be emerging as the de facto. Hosts a 200TB dataset on S3. Notebooks have been around in the scientific community for a...
tom-e-white.com
Tom White: Hadoop for Science
http://www.tom-e-white.com/2015/01/hadoop-for-science.html
Problems worthy of attack prove their worth by hitting back. —Piet Hein. Friday, 16 January 2015. Some of the largest datasets are generated by the sciences. For example, the Large Hadron Collider produces around 30PB of data a year. I'm interested in the technologies and tools for analyzing these kind of datasets, and how they work with Hadoop, so here's a brief post. Amazon S3 seems to be emerging as the de facto. Hosts a 200TB dataset on S3. Notebooks have been around in the scientific community for a...
onstrategies.com
Spark Summit debrief: Relax, the growing pains are mundane | OnStrategies Perspectives
http://www.onstrategies.com/blog/2015/03/22/spark-summit-debrief-relax-the-growing-pains-are-mundane
Spark Summit debrief: Relax, the growing pains are mundane. March 22, 2015. As the most active project (by number of committers) in the Apache Hadoop open source community, it’s not surprising that Spark. Has drawn much excitement and expectation. At the core, there are several key elements to Spark’s appeal:. Is doing with the SNAP framework. To differentiate its proprietary Aster platform. Among others, has termed Spark. So we were quite pleased to see Spark Summit making it to New York. Whose founders...