Top Navigation Menu

Big Data Boot Camp

$1,395.00$1,795.00

This course will provide a technical overview of Apache Hadoop for project managers, business managers and data analysts. Students will understand the overall big data space, technologies involved and will get a detailed overview of Apache Hadoop. The course will expose students to real world use cases to comprehend the capabilities of Apache Hadoop. Students will also learn about YARN and HDFS and how to develop applications and analyze Big Data stored in Apache Hadoop using Apache Pig and Apache Hive. Each topic will provide hands on experience to the students.

 

Classroom Live Online  14 PDU’s Custom Sessions

Clear

FREE Items

Select a free item included with your class. (more info)

Class Description

An introduction to Big Data with an emphasis on Apache Hadoop.

This course will provide a technical overview of Apache Hadoop for project managers, business managers and data analysts. Students will understand the overall big data space, technologies involved and will get a detailed overview of Apache Hadoop. The course will expose students to real world use cases to comprehend the capabilities of Apache Hadoop. Students will also learn about YARN and HDFS and how to develop applications and analyze Big Data stored in Apache Hadoop using Apache Pig and Apache Hive. Each topic will provide hands on experience to the students.

The course is developed and taught by certified Hadoop consultants who have a passion for teaching and help deliver value to various clients using Big Data and Hadoop technologies on a daily basis.
Features of this Class:

  • Led by experienced Big Data and Hadoop consultants
  • Hands on Activities that will immerse you into the capabilities of the Hadoop ecosystem
  • Assumes no prior background in Big data
  • Material will be designed by a computer science professor using pedagogical techniques to help the student understand the material in an easier manner.
  • Provides both foundational and practical knowledge that is essential for successful big data ventures.
  • Fast paced introduction to get you up to speed with Big Data.
  • Includes materials covered in most Hadoop certification exams
  • Labs with practice sessions on a Hadoop cluster
  • Will include insights learnt from experience on real projects
  • Will include things to do and things not to do with big data projects

20 Immediate Benefits of Participating in this Workshop:

  1. Learn about the big data ecosystem
  2. Understand the benefits and ROI you can get from your existing data
  3. Learn about Hadoop and how it is transforming the workspace
  4. Learn about MapReduce and Hadoop Distributed File system
  5. Learn about using Hadoop to identify new business opportunities
  6. Learn about using Hadoop to improve data management processes
  7. Learn about using Hadoop to clarify results
  8. Learn about using Hadoop to expand your data sources
  9. Learn about scaling your current workflow to handle more users and lower your overall performance cost
  10. Learn about the various technologies that comprise the Hadoop ecosystem
  11. Learn how to write a simple map-reduce job from Java or your favorite programming language
  12. Learn how to use a very simple scripting language to transform your data
  13. Learn how to use a SQL like declarative language to analyze large quantities of data
  14. Learn how to connect your existing data warehouse to the Hadoop ecosystem
  15. Learn how to move your data to the Hadoop ecosystem
  16. Learn how to move the results of your data analysis to Business Intelligence Tools like Tableaux
  17. Learn how to automate your workflow using oozie
  18. Learn about polyglot persistence and identifying the right tool for the right job
  19. Learn about future trends in Big data and technologies to keep an eye on
  20. Discover tips and tricks behind successful Hadoop deployments

Course Outline

1. Introduction to Big Data

•Big Data – a major business & technology trend in enterprise computing
•Exponentially increasing data from ERP data to CRM data to Web data to Big Data
•Big data sources – sensors, social, geospatial, video, others
•Data warehousing, business intelligence, analytics, predictive statistics, data science

2. Survey of Big Data technologies

•First generation RDBMS, ETL and BI systems
•Second generation systems – columnar databases with compression and MPP architectures, data warehousing appliances
•Streaming processing, statistical processing, data visualization
•Enterprise search
•NOSQL databases
•How do technologies like mongodb, MarkLogic, couchdb fit in?
•What is polyglot persistence?
•Apache Hadoop

3. Introduction to Hadoop

•What is Hadoop? Who are the major vendors?
•A dive into the Hadoop Ecosystem
•Benefits of using Hadoop
•How to use Hadoop within your infrastructure?
•Where do we use Hadoop?
•Where do we look at options besides Hadoop?

4. Introduction to MapReduce

•What is MapReduce?
•Why do you need MapReduce?
•Lab: How to use MapReduce in Hadoop?
•How does it work from languages like Java?
•How does it work with languages like Ruby?

5. Introduction to Yarn

•What is Yarn?
•What are the advantages of using Yarn over classical MapReduce?
•Lab: How to use Yarn within Hadoop?
•How does it work from languages like Java?
•How does it work with languages like Ruby?

6. Introduction to HDFS

•What is HDFS?
•Why do you need a distributed file system?
•How is a distributed file system different from a traditional file system?
•What is unique about HDFS when compared to other file systems?
•Is HDFS reliable?
•Does it offer support for compressions, checksums and data integrity?
•Lab: Overview of HDFS commands
•Standard file system commands
•Moving data to and from HDFS

7. Data Transformation

•Why do you need to transform data?
•What is Pig?
•Use cases for Pig
•Lab: Hands on activities with Pig
•Joining Data
•Filtering Data
•Storing and Loading Data

8. Structured Data Analysis?

•How do you handle structured data with Hadoop?
•What is Hive/HCatalog?
•Use cases for Hive/HCatalog
•Lab: Hands on activities with Hive/HCatalog
•Storing and Loading Data
•Select expressions
•Hive vs SQL

9. Loading data into Hadoop

•How do you move your existing data into Hadoop?
•What is Sqoop?
•Lab: Hands on activities with Sqoop
•Running evaluation commands with Sqoop
•Importing data from relational databases
•Exporting data to relational databases

10. Automating workflows in Hadoop

•Benefits of Automation
•What is oozie?
•Lab: Demonstration of oozie
•Creating a workflow
•Running a workflow automatically at regular intervals
•Running a workflow automatically when some events are triggered

Who Should Attend?

Anybody who is involved with databases, data analysis, wondering how to deal with the mountains of data (any where gigabytes of user/log data etc to petabytes will benefit from this program. This course is perfect for:

•Business Analysts
•Software Engineers
•Project Managers
•Data Analysts
•Business Customers
•Team Leaders
•System Analysts

Additional Info

Class Length

3 Days

Class Locations

Atlanta-Sandy Springs-Gainesville, GA-AL, Live Virtual Class-Attend from Anywhere, Miami-Fort Lauderdale FL, Washington-Baltimore-Northern Virginia, DC-MD-VA-WV

Class Dates

Dec 14, 2015 thru Dec 15, 2015, Feb 22, 2016 thru Feb 23, 2016, Feb 29, 2016 thru Mar 02, 2016, Jan 21, 2016 thru Jan 22, 2016, Jan 25, 2016 thru Jan 27, 2016

Guarantee & Policies

Course registration info, our commitment to your privacy,

and general terms and conditions

 

Course registration information:

All courses carry Project Consult’s Guarantee of 100% Satisfaction: 
Project Consults provides an unsurpassed training experience. If for any reason you are not satisfied with the program, simply notify the instructor or registrar of your intent to withdraw from the program prior to the first morning break, turn in your course materials and receive a 100% refund. If at the end of the program day, you are unsatisfied with the program, we will credit your tuition towards a future program of your choice.

Payment Policy:
Payment is required at time of registration. Approved forms of payment include a company purchase order, PayPal, or credit card. We accept Visa, MasterCard, American Express, and Discover.

Courses are available as onsite training: 
All courses are available as onsite training at your location. On-site options can be very cost effective.

Course Hours:
This course begins promptly at 8:30 AM and ends at 4:30 PM, unless otherwise noted on the course page or in email notifications. Please arrive at 8:00 AM on the first morning of class to sign-in and meet your fellow attendees.

Shipping of Course Materials:
In an effort to reduce paper waste, course materials for live-online sessions will be digital. You will recieve information on how to obtain your course materials in your confirmation email.

Substitution & Cancellation Policy: 
If a change needs to be made to your public course registration (cancel, transfer, or substitution) Project Consults must receive written notice via email at sales@projectconsults.com. If a cancel or transfer request is made less than 15 business days prior to the class start date, payment will still be due, no refunds will be issued and you will be charged a $200 change fee. Your paid tuition will be available for one year to be used as a credit towards another course of equal value; only one reenrollment opportunity is allowed. Failure to attend the course without written notification will result in forfeiture of the full course price. Student substitutions may be made at any time prior to the start of class free of charge. If a student substitution is made for a live, online session and any hard-copy materials have been provided to the initial student, it is the responsibility of the client to pass along those materials to the new attendee. If Project Consults is forced to cancel a course for any reason, liability is limited to the registration fee only. If you have questions or concerns, please contact sales@projectconsults.com or call 469-424-1084.

In certain situations, Project Consults may not have the required enrollment to hold a course as scheduled. We do our best to confirm every class, but our main mission is to provide students with the skills and knowledge to have a positive impact on their work performance. Based on this, should there be a cancellation for a class you are enrolled in, Project Consults will proactively automatically enroll you into the next available live, online session of the same course to provide you with the knowledge you originally needed. You will be notified during this process and have the ability to work with an Project Consults representative regarding alternate options if you are unable to attend the new session.

Substitution & Cancellation Policy (PMP Boot Camp): 
If you are unable to attend your scheduled training class, please contact us directly at 469-424-1084. We require a 16 calendar day notice to reschedule or to cancel any registration (and receive refund for payment). Failure to provide the required notification will result in a 100% charge of the course fee. If a student does not attend a scheduled course without prior notification, or contacts us to cancel within the notification window, the student will have the option to pay a $200 reschedule fee to attend one of the live, online sessions of the PMP Boot Camp. Within the notification period, only student substitutions will be permitted.

Hotel Reservations:
Project Consults does not set aside a block of rooms for class participants. If you wish to book a sleeping room please contact Project Consults for the best hotel options or recommendations. For directions to the course location please call Project Consults or you may also contact the training center or hotel directly.

Reviews

There are no reviews yet.

Be the first to review “Big Data Boot Camp”