• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

hadoophackdaydelhi

This version was saved 14 years, 2 months ago View current version     Page history
Saved by jboutelle
on January 17, 2010 at 9:44:54 am
 

 

 

-------------

 

HadoopHackDay Winners

First Prize : Sri Prasanna & Anshuman Nangia (from SlideShare)......   their hack was titled "Adsortion based slideshows suggesstion and discovery"

Outside Prize : Sanjay Sharma/Chandra Prakash Bhagtani (from Impetus) ... or their hack about using Hadoop for building decision making engine for Business Analysts

Congrats prize winners ! Thanks everyone for participating... this was an awesome show!

 

-------------

<div style="width:425px;text-align:left" id="__ss_2935493"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/AmitRanjan/hadoop-hackday-at-the-slideshare-office" title="Hadoop Hackday at the SlideShare office">Hadoop Hackday at the SlideShare office</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=hackday-100117113304-phpapp02&stripped_title=hadoop-hackday-at-the-slideshare-office" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=hackday-100117113304-phpapp02&stripped_title=hadoop-hackday-at-the-slideshare-office" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/AmitRanjan">Amit Ranjan</a>.</div></div>

 

What is HadoopHackDay?

 

Hadoop Hackday Delhi is a fun, interactive, collaborative and semi-competitive event centered around Hadoop hacking. The goal is to experiment, learn Hadoop, and build something useful, imaginative and working using Hadoop and related distributed computing technologies.  The event is open to all.

 

For those who don't know, Hadoop is an open-source implementation of the map-reduce design pattern pioneered by google. It allows you to chop a big problem (like, ehem, crawling and indexing the entire internet) into a lot of little problems.

 

If you haven't used hadoop before that is perfectly fine. Most of the participants have never used hadoop before, so you won't be at a disadvantage. Many participants plan on using Amazon Elastic MapReduce, and writing programs in Pig or Hive. So this will be an excellent opportunity to learn how to get started using hadoop with minimal operations overhead in a supportive (yet semi-competitive) environment. By the end of the weekend you should be up and running with Hadoop, and able to use it in your personal or professional life with confidence!

 

When

Saturday, 9th Jan 2010 and Sunday, 10th Jan 2010

 

Where

SlideShare New Delhi office (221, First Floor,, Okhla Phase III,New Delhi-110020)

Directions : Come to the Modi Mill Compound in Okhla Phase III and ask for the Mercedes Benz showroom; our office is two buildings away from the showroom.

When you enter our building ask for Uzanto on the first floor (thats the name of our company... most people do not know the name SlideShare :)

 

Agenda

Day 1 (9th Jan 2010)

  • Start at 10.30AM
  • 15 mins talk by Jonathan Boutelle
  • Hackday preparations 11AM - 12PM
  • Hack starts at 12PM
  • Lunch: 1PM - 2PM
  • Dinner: 8PM - 9PM

 

Day 2 (10th Jan 2010)

  • Breakfast: 9AM - 9.30AM
  • 1 hour left warning at 11AM
  • Hack ends at 12PM
  • Lunch: 1PM - 2PM
  • Setup for demoes: 2.30PM - 3.30PM
  • Working demoes - 15 mins max for each one: 3.30PM - 5PM
  • Results & Prize distribution: 6PM

 

 

Hacks and Teams

List your hacks and team members (maximum 2) below. 1 team can submit as many hacks as they want. Prizes will be given to winning teams.

(please leave names of all team members, plus an email address or domain. space is somewhat limited but we hope to be able to accomodate all teams).

 

1. Mani and Aadhar (SlideShare)

 

Hack: Hadoop with Pig for the classfication of unclassified slideshows based on the training from classified ones.

Hack: Hadoop with Pig to parse slideshare haproxy logs and get a list of search engine keywords that have been used to get to the site, and their occurances.

Hack: Hadoop with Hive to analyze information from the haproxy logs.

 

2. Prasanna and Anshuman (SlideShare)

 

Hack :  Adsortion based slideshows suggesstion and discovery

 

3. Garima and Siddhant (SlideShare)

Hack : Suggesting presentations to (slideshare) users based on different metadata available

 

4. Chetan Sachdev (email: cksachdev[at]gmail[dot]com )

 

Hack : Using Hadoop for making decision making engine for Business Analysts

 

5. Sanjay Sharma/Chandra Prakash Bhagtani (e: sanjay[dot]sharma[at]impetus[dot]co[dot]in  / chandrap[dot]bhagtani[at]impetus[dot]co[dot]in )

 

Hack: Using Hadoop for making decision making engine for Business Analysts

 

6. Lalit Shandilya, shanlalit at gmail.com

7. Siddharth Mitra (sidmitra DOT del AT gmail DOT com)

 

8. Team name: Decepticons

Team members: Kapil and Giri (SlideShare) (kapil AT slideshare DOT com, giri DOT gaurav AT slideshare DOT com)

Hack: Distributed Chores - using hadoop and pig to parse slideshare haproxy logs and answer interesting questions about performance, traffic trends etc

 

9. Shashwat Anand (anand[dot]shashwat[at]gmail[dot]com)

Hack: Parsing Indian trade mark journals for creating an ordered dataset with Amazon Elastic MapReduce, Python and Hadoop Streaming

 

10. Team name: DarkSiders

Team Members:Sohil Gupta and Nitin Goyal(sohilgupta AT gmail DOT com AND nitin2goyal AT gmail DOT com)

HACK: Page Ranker - parsing referrer contained in slideshare access logs and take note of page ranks of slideshare pages for those search terms.

Judges

Jon Boutelle (CoFounder & CTO, SlideShare) and Amit Ranjan (CoFounder & COO, SlideShare)

 

 

Prizes

 

a) First prize (anyone can win this, whether they work for slideshare or not) is an Ipod Touch

b) Outsiders prize (only non-slideshare employees can win) is an Ipod Shuffle

If two people are on a team, they will each get a prize.

 

 

 

Useful Links

http://hadoop.apache.org

http://hadoop.apache.org/pig/ 

http://www.cloudera.com/

http://s3.amazonaws.com/awsVideos/AmazonElasticMapReduce/ElasticMapReduce-PigTutorial.html

 

 

Connect

IRC: Join #hadoophackday room at chat.slideshare.net:46664

Mailing List: hadoophackday@slideshare.com (send email to kapil@slideshare.com to subscribe or unsubscribe)

Twitter List: http://twitter.com/slideshare/hadoophackday

Twitter Hashtag: #hadoophackday