Certificate in Advanced Big Data and Data Analytics (CABDDA)

Course DMBI-011

Certificate in Advanced Big Data and Data Analytics (CABDDA)

Big data is a change agent that challenges the ways in which organizational leaders have traditionally made decisions. The outline covers Big Data Analytics Use Cases, Storing Big Data, Computing Big Data, and Big Data A...

Classroom

8 sessions
18 - 22 May 2026 Barcelona €2,695 Register
22 - 26 June 2026 Frankfurt €2,275 Register
17 - 21 August 2026 Kuala lumpur €1,575 Register
21 - 25 September 2026 Barcelona €2,695 Register
19 - 23 October 2026 London €2,940 Register
2 - 6 November 2026 Munich €2,415 Register
21 - 25 December 2026 Amsterdam €2,975 Register
Scroll for more

Online / Live

8 sessions
Scroll for more

Introduction

Course overview

Why Attend

Big data is a change agent that challenges the ways in which organizational leaders have traditionally made decisions. This course provides participants with the confidence to store, process, analyze and present big data use cases within their organizations. This course provides a multitude of hands-on labs with Spark, a key big data technology used to solve data intensive problems. Participants will gain the knowledge and skills they need to assemble and manage a large-scale big data analytics project. Lastly, participants will work through advanced machine learning and deep learning use cases.

This is our most advanced course in our big data series following Certified Big Data and Data Analytics Practitioner (CBDDAP) and Certificate in Big Data Fundamentals (CBDF). Participants will aim to identify areas within their organization that can be improved through big data use cases, and work on an individual chosen data project during the course. By the end of the course, participants will be able to work through multiple methods and practical approaches to leverage Spark for advanced big data analytics.

Course Methodology

This course will be highly technical with group discussions, hands-on practical exercises, and group activities being the core focus.

Course Objectives

By the end of the course, participants will be able to:

  • Understand key big data technologies, including a deep dive into Apache Spark
  • Describe the main challenges and advantages of Hadoop map-reduce
  • Demonstrate and discuss key technologies for big data storage and compute, such as PostgreSQL and object storage
  • Discuss popular machine learning algorithms, deep learning techniques and the importance of ethics in data analytics and artificial intelligence
  • Deliver a presentation demonstrating the analytics lifecycle and Spark
Target Audience

This is an advanced level course. It is expected that participants either have a number of years of experience utilizing big data, or have previously attended the Certified Big Data and Data Analytics Practitioner (CBDDAP) course. This course is ideal for data engineers, AI engineers and data scientists. Recommended pre-knowledge includes some python programming experience and data visualization practice.  

Target Competencies
  • Big data utilization
  • Big data analytics structures and technologies
  • Ethics and integrity for big data and AI development
  • Big data storage
  • Apache Spark best practices

What you will achieve

Learning objectives

  • Understand key big data technologies, including a deep dive into Apache Spark
  • Describe the main challenges and advantages of Hadoop map-reduce
  • Demonstrate and discuss key technologies for big data storage and compute, such as PostgreSQL and object storage
  • Discuss popular machine learning algorithms, deep learning techniques and the importance of ethics in data analytics and artificial intelligence
  • Deliver a presentation demonstrating the analytics lifecycle and Spark

Who should attend

Target audience

  • This is an advanced level course. It is expected that participants either have a number of years of experience utilizing big data, or have previously attended the Certified Big Data and Data Analytics Practitioner (CBDDAP) course. This course is ideal for data engineers, AI engineers and data scientists. Recommended pre-knowledge includes some python programming experience and data visualization practice.
  • Target Competencies
  • Big data utilization
  • Big data analytics structures and technologies
  • Ethics and integrity for big data and AI development
  • Big data storage
  • Apache Spark best practices

Methodology

Learning approach

  • This course will be highly technical with group discussions, hands-on practical exercises, and group activities being the core focus.

Course content

Course outline and key learning areas

Module 1

Big Data Analytics Use Cases

  • How can big data projects meet organizational needs
  • Big data examples:
  • Netflix
  • LinkedIn
  • Facebook
  • Google
  • Orbitz
  • Dell
  • Others
  • Best practices in project design

Module 2

Storing Big Data

  • Big data architectures and paradigms
  • The Hadoop Ecosystem
  • Overview of Hadoop
  • Hadoop Distributed File System (HDFS)
  • Massively parallel processing (MPP) versus distributed in-memory applications
  • RDBMSs vs NoSQL DBs
  • PostgreSQL
  • MongoDB
  • Cassandra
  • Streaming data

Module 3

Computing Big Data

  • How to access big data
  • Role of cloud computing
  • Data movement risk
  • Networking and co-location
  • Apache Spark lab
  • Big data extract, transform, load (ETL) big data compute technologies
  • Distributed compute
  • High performance clusters vs Apache Spark
  • Streaming: Storm, Spark structured streaming
  • Apache Spark ETL labs

Module 4

Big Data Advanced Analytics and AI

  • Analytics Lifecycle
  • Apache Spark vs Pandas
  • Big data machine learning & deep learning in Spark
  • Importance of ethics in AI
  • Automl & Hyperparameter tuning

Module 5

Course Big Data Projects

  • Identify analytical opportunities in an organization
  • Define and assess the problem
  • Describe the impact and use of data to address the problem
  • Identify potential data sources
  • Design a data analytics project
  • Access, explore, analyze and visualize chosen dataset for project
  • Present project insights in course

FAQ

Frequently asked questions

What does Certificate in Advanced Big Data and Data Analytics (CABDDA) (DMBI-011) cover?

This course covers Data Management and Business Intelligence through a structured five-day outline focused on practical application, discussion, and implementation planning.

When is the next available session?

The next scheduled session starts on 18 - 22 May 2026, with additional classroom dates and mirrored Online / Live options listed in the course schedules section.

Who should attend this course?

This is an advanced level course. It is expected that participants either have a number of years of experience utilizing big data, or have previously attended the Certified Big Data and Data Analytics Practitioner (CBDDAP) course. This course is ideal for data engineers, AI engineers and data scientists. Recommended pre-knowledge includes some python programming experience and data visualization practice., Target Competencies, Big data utilization

How can I register for a session?

Use any Register button next to the available course dates to open the participant registration page and submit your booking request for the selected session.

Is this course available online as well as classroom-based?

Yes. The course detail page includes both classroom sessions and Online / Live sessions, with online options aligned to the same course dates for easier planning.

Where are classroom sessions delivered?

Current classroom venues include Barcelona, Frankfurt, Rome, Kuala lumpur, London, Munich, Amsterdam.

Still Have Questions?

Contact the academy team for course details, delivery options, and delegate guidance.

Contact Us