Tag Archives: ETL

Storm: the Hadoop of Realtime Stream Procesing

This presentation was great to get a peek at what Twitter’s Storm was about: YouTube: PyCon US 2012: Gabriel Grant: Related: Twitter Engineering: “A Storm is coming: more details and plans for release” GitHub: Storm

Posted in Coding, Software Engineering, Programming | Tagged , , | Comments Off

Data Journalism and Visualization with an Example

Guardian: Paul Bradshaw: “How to be a data journalist” ProPublica: Jeff Larson: “The Rainbow Connection: How We Made Our CDO Connections Graphic” (tools mentioned: google-refine (formerly Gridworks), RaphaĆ«l, JSON)

Posted in Coding, Software Engineering, Programming, Journalism, norgs, and the future of news | Tagged , , , , , , , | Comments Off

A Yahoo! Pipes influenced Python toolset for ETL

PyF looks interesting. In a similar vein is Ruffus and Orange (Orange looks impressive and has data analysis capability to boot).

Posted in Coding, Software Engineering, Programming | Tagged , , , | Comments Off

What is ETL and CMS?

You’re a programmer with a task to retrieve information from some source, manipulate and message it, and to deploy it somewhere. Like all things in programming, there is an acronym for that: “ETL”. ETL stands for Extract, Transform, and Load. … Continue reading

Posted in Coding, Software Engineering, Programming | Tagged , , | Comments Off

NoSQL, Relational Database, ETL Link-a-rama for November 25th, 2009

Jon Moore: NoSQL East 2009 Redux Dare Obasanjo: Building Scalable Databases: Perspectives on the War on Soft Deletes Explain Extended: What is a relational database? Explain Extended: What is the entity-relationship model? Data Doghouse: Data Integration: Hand-coding Using ETL Tools … Continue reading

Posted in Coding, Software Engineering, Programming | Tagged , , , | Comments Off

Hive, Hadoop at Facebook, Yahoo

Engineering@Facebook: Hive – A Petabyte Scale Data Warehouse using Hadoop Yahoo! Developer Blog: Announcing the Yahoo! Distribution of Hadoop

Posted in Coding, Software Engineering, Programming | Tagged , , , , , | Comments Off

Reading up on ETL (Extract, Transform, Load) processing

Wikipedia: Extract, transform, load Wikipedia: Talend Open Studio Talend Open Studio: Tutorials Manageability: Open Source ETL (Extraction, Transform, Load) Written in Java richard.gluga.com: Data Migration Done Right kJube: Vendors and tools – ETL AlfrescoForge: ETL Connector Talend job for Job … Continue reading

Posted in Coding, Software Engineering, Programming | Tagged , , , , , , | Comments Off