Category Archives: Implementing Technology

Big Data

My Notes

  1. Hadoop
    1. Hadoop local vs “in the cloud” (i.e., EC2)
      1. Need EC2 CLI (on local host) – http://aws.amazon.com/developertools/351
  2. Hive vs Pig vs HBase
  3. MapReduce
Books
  • Hadoop in Action by Chuck Lam (Mannin)
  • Hadoop Operations by Eric Sammer (O’Reilly)
  • Programming Hive by Capriolo, Wampler, Rutherglen (O’Reilly)

– manzoor

DB Programming

DB not limited to MySQL and Postgress but includes Redis, MongoDB, elasticsearch etc.

The goal is to find the lowest possible tool – i.e., scheel commands would be the lowest and dedicated DB tool would be the highest. Python, Ruby, PHP would be on the lower end of the middle and Java would be on the higher.

– manzoor

Testing a Quick Connection to Facebook, Twitter

I need to be able to test a connection to Twitter from various systems to troubleshoot an connect issue from a particular instance. A very simple CURL command would be perfect, but if I try to get anything more than just a page (of a user I already know), it seems I need to authenticate. To make things more complicated, basic authentication has been discontinued since the introduction of OAUTH.

– manzoor