Cloudera Hadoop Developer Certification(CCD-410)

I have cleared Cloudera certification in May 2015 . I would like to share how to work towards preparing yourself for the cloudera certification –

  • Hadoop Definitive Guide(3rd Edition/4th Edition)
  • Good Training : way2learnonline.com
  • HadoopExam “Simulator” from http://www.hadoopexam.com will guide you very well
  • https://developer.yahoo.com/hadoop/tutorial/
  • There will be some questions from sqoop as well.Check various ways of Import/Export options.
  • 1-2 questions from Hive/Pig/Flume/Oozie.
  • Go Thru http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html
  • Functionality of classic map reduce and yarn daemons should be known
  • Most of the questions will be Map-Reduce related and some questions will be exercises of Map-Reduce.
  • Have a look at regular expressions as some questions may include regular expressions
  • Check Streaming,JVM Reuse,Speculative Execution,Skipping Bad Records
  • In MapReduce , have a look at following –
  • MapSide Join(Replicated Join)
  • Composite Join
  • Reducer side join(Repartition join)
  • Check Secondary sort,Inverted Index,Total Order Partitioner,NGRAM
  • Use various InputFormats(SequenceFileInputFormat,TextInputFormat ,NLineInputFormat etc), various OutputFormats
  • MultipleInputs
  • MultipleOutputs – http://www.lichun.cc/blog/2013/11/how-to-use-hadoop-multipleoutputs/
  • CustomInputFormat,RecordReader etc –
  • Combiner
  • Custom Writable,WritableComaparable,RawComparator
  • ControlledJobs
  • CustomPartitioner
  • MRUnit
  • Do some basic exercises in MapReduce for practice
  • Write MR code equivalent to queries like –
    • select field1,field2 from table
    • select distinct field1,field2 from table
    • select field1,count(*) from table group by field1
    • select field1,field2 from table order by field1,field2
    • …you can think of many more
  • ALL THE BEST ….
Advertisements