Pros and cons for NoSQL

Pros: a tornado of razorblades: SQL Databases Don’t Scale (Hacker News thread) Cons: Code Monkeyism: The dark side of NoSQL (Hacker News thread) Related: Archives of the Caml mailing list: Message from Brian Hurt Chris Williams , Co-Curator of NoSQL East, NoSQL: A Modest Proposal Carsonified: Should you go Beyond Relational Databases?

Continue reading →

The UNIX Way

Kas Thomas of CMS Watch riffs on “The UNIX Way”, principals summarized by Mike Gancarz: 1. Small is beautiful. 2. Make each program do one thing well. 3. Build a prototype as soon as possible. 4. Choose portability over efficiency. 5. Store data in flat text files. 6. Use software leverage to your advantage. 7. […]

Continue reading →

Reading up on ETL (Extract, Transform, Load) processing

Wikipedia: Extract, transform, load Wikipedia: Talend Open Studio Talend Open Studio: Tutorials Manageability: Open Source ETL (Extraction, Transform, Load) Written in Java richard.gluga.com: Data Migration Done Right kJube: Vendors and tools – ETL AlfrescoForge: ETL Connector Talend job for Job Scheduler implement High Scalability: How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of […]

Continue reading →

Smart aggregation and API use in NPRbackstory

NPRbackstory is an automated Twitter feed that attempts to add context to the news stories trending popular today according to Google’s Hot Trends. It leverages NPR’s archives (very smart, as Joshua Benton notes archives are underused assets), and Yahoo! Pipes to produce a RSS feed that is fed into the NPRbackstory account. It was developed […]

Continue reading →