phisymmetry.wordpress.com
Real-time Streaming ETL and Real-time BI – Part 2 – Technology Musings
https://phisymmetry.wordpress.com/2015/09/26/real-time-streaming-etl-and-real-time-bi-part-2
Just another WordPress.com site. Real-time Streaming ETL and Real-time BI – Part 2. Date: September 26, 2015. In Part-1 (published in 2012) we discussed limited options for streaming real-time ETL and Analytics on Hadoop. Lets explore how Kafka works with Storm / Spark / Samza. We briefly touch upon SQLStream and DataTorrents. Streaming technology has matured a lot and its now possible to develop a fault-tolerant , scalable messaging and streaming solution. Courtesy : O’reilly Data Newsletter. Apps and S...
dom.as
how innodb lost its advantage – domas mituzas
https://dom.as/2015/04/09/how-innodb-lost-its-advantage
How innodb lost its advantage. The new way is “ InnoDB Transparent PageIO Compression. 8221; – and it makes lots of sense from full-stack architecture perspective. It relies on the fact that high end flash storage devices already have a log-structured block storage internally, and if one ties directly into it, lots of overhead can be avoided (similar concepts are used by MariaDB’s atomic writes. Another problem is that buffer pool is no longer compressed. This may mean you will need to buy devices wi...
linkedbigdata.com
LinkedBigData: January 2015
http://www.linkedbigdata.com/2015_01_01_archive.html
Tuesday, January 27, 2015. Links: January 27, 2015. Quasar (Java library that provides high-performance lightweight thread, Go-like channel and Erlang-like actor). BTrace (Java application tracing tool without restart, use Java syntax, have many intended restrictions). RocksDB (embeddable low latency key-value store by Facebook, based on Google LevelDB, used by LinkedIn Samza). Tig (text-mode interface for Git, see also hub. Gitlet (implemention of Git in JavaScript). Reverse Engineering for Beginners.
blog.jassassin.com
HELLO SAMZA | Jassassin
http://blog.jassassin.com/2015/04/30/samza/hello-samza
分布式消息系统.而目前Apache Samza已经成为Apache基金会顶级项目 总体来说Samza服务架构由数据流层(Kafka),执行层(Yarn)以及处理层(Samza API)构成。 Jassassin # useradd -m -s /bin/bash -g root samza. Jassassin # passwd samza. Jassassin # vim /etc/sudoers. Jassassin # su - samza. Export JAVA HOME=/home/samza/jdk1.7". Export PATH= $JAVA HOME. Sudo apt-get install git. Sudo yum install git. Samza@jassassin $ git clone. Https:/ github.com/apache/samza-hello-samza.git hello-samza. Samza@jassassin /hello-samza $ ./bin/grid. 125;, "source". 123;is-talk:2,bytes-ad...
quora.com
Benjamin Darfler - Quora
https://www.quora.com/Benjamin-Darfler
This page may be out of date. Save your draft. Before refreshing this page. Submit any pending changes before refreshing this page. Distributed Systems Builder, NoSQL Tamer, Meditator, Father. Designing the next generation data platform at Localytics. So we can provide a world class mobile marketing experience. 281 Answers Most Recent. What unique ID schemes are horizontally scalable and naturally orderable? Distributed Systems Builder, NoSQL Tamer, Meditator, Father. Most Viewed Writer in. Views on Benj...
github.com
cockroach/README.md at master · cockroachdb/cockroach · GitHub
https://github.com/cockroachdb/cockroach/blob/master/README.md
Jul 27, 2016. Remove google groups and add forum. Users who have contributed to this file. 182 lines (125 sloc). A Scalable, Survivable, Strongly-Consistent SQL Database. CockroachDB is a distributed SQL database built on a transactional and strongly-consistent key-value store. It scales. Disk, machine, rack, and even datacenter failures with minimal latency disruption and no manual intervention; supports strongly-consistent. ACID transactions; and provides a familiar SQL. For more details, see our FAQ.
blog.jassassin.com
Page 2 | Jassassin
http://blog.jassassin.com/page/2
Nothing is impossible for a willing heart. 分布式消息系统.而目前Apache Samza已经成为Apache基金会顶级项目 总体来说Samza服务架构由数据流层(Kafka),执行层(Yarn)以及处理层(Samza API)构成。 但是安装相当麻烦.最近偶然将hexo升级到了3.0版本,结果出现了很多问题.更悲催的是 chenall. 目前还不能兼容3.0版本.使用 chenall. 主题也有相当一段时间了,也想换换感觉.于是在Github上 hexo themes. 2016 Jassassin with help from Hexo.
jinrongxinxi.blogspot.com
金融: 六月 2015
http://jinrongxinxi.blogspot.com/2015_06_01_archive.html
来源:http:/ zhuanlan.zhihu.com/donglaoshi/20009819. 这里面创业公司太多了,包含提供商务数据分析,可视化报表,大数据平台,数据存储,挖掘应用等,我就简单说一些我感兴趣的,它们大多在硅谷,其他的可以参考。更新到2015年6月8日,92家。 65306;融资:9.5亿美元。150亿美金估值,已经是超级独角兽单独列出来。Peter Thiel创办大数据公司。数据集成、 信息管理和定量的分析。连接到商业、 专有和公共数据集,并发现趋势、 关系和异常,包括预测分析。 65306;高效、大容量的图形数据库和分析平台,创始人是国人。 融资:3.11亿美元。细分行业:面向文档数据库采集。它灵活的存储方式非常受青睐。 65306;融资:1.9亿美元。细分行业:基于Apache Cassandra的数据库支持平台。客户包括eBay、Adobe、Netflix等. Open-source, scalable database that makes building realtime apps dramatically easier. 65306;可靠的、可伸缩的&#...
SOCIAL ENGAGEMENT