Apache Cassandra Training
The Apache CassandraTM training course provides an overview of the following:
- Fundamentals of big data and NoSQL databases
- Cassandra and its features
- Architecture and data model of Cassandra
- Installation, configuration, and monitoring of Cassandra
- Hadoop ecosystem of products around Cassandra
Apache CassandraTM course is popularly used in many companies across the world. Cassandra is in the forefront of the NoSQL origination, providing higher scalability and thus allowing companies to store petabytes of data.
Cassandra is carefully designed to handle huge workloads across multiple datacenters, with not a single point of failure. This feature of Cassandra provides enterprises with greater database availability and performance. Cassandra is an open-source project, which allows you to use the source code and modify it as per your organizational requirements.
As per the latest survey, the job prospective for Cassandra is growing at the rate of 300%.
- Describe the need for big data and NoSQL
- Explain the fundamental concepts of Cassandra
- Describe the architecture of Cassandra
- Demonstrate data model creation in Cassandra
- Use Cassandra database interfaces
- Demonstrate Cassandra database configuration
- Able to demonstrate their expertise in the fast-growing big data industry
- Well-placed to benefit from the growing demand in NoSQL databases
- Able to benefit from the shortage of Cassandra trained professionals
- Well-equipped to take their organization towards big data analytics using Cassandra
- Experienced in tools used to process huge amounts of data
- In the forefront of the big data technology, which is expected to be in demand for the next ten years
The Apache CassandraTM certification helps IT professionals add weight to their profile, and typically earn more compared to their non-certified peers. The certified professionals can look for career paths such as Senior Software Professional, IT Consultants, Lead Software Professionals, and Database Administrators.
The value of Apache CassandraTM has increased sharply in the recent years and this certification is fast becoming an entry requirement for a majority of IT-based roles. According to a recent survey, the average salary of a certified professional in this field ranges from $75,000 to $132,000.
Apache Cassandra®- Course Agenda
Lesson 0 – Course Overview
- Course Overview
- Target Audience
- Course Prerequisites
- Value of Professionals
- Lessons Covered
Lesson 1 — Introduction to big data and No-SQL Databases
- 3Vs of big data
- Data evolution and characteristics of big data
- Industry Examples
- Apache Hadoop – HDFS and Mapreduce
- Brewer’s CAP principle
- Types of NoSQL database databases
Lesson 2 — Introduction to Cassandra
- Introducing Cassandra
- Cassandra Origin
- Main features of Cassandra
- Cassandra Example and Command line interface
- Cassandra Advantages and limitations
- PuTTY
- WinSCP
- Demo: Installing Ubuntu VM and connecting with Putty
Lesson 3—Architecture of Cassandra
- Cassandra Architecture
- Cassandra write process
- Cassandra read process
- Data partitions and replication
- Network topology and snitches
- Gossip protocol and seed nodes
- Virtual nodes and token generator
- Failure scenarios
Lesson 4—Installation and Configuration of Cassandra
- Cassandra versions
- Preparation for installation
- Installation of Cassandra
- Configuration of Cassandra
- Installing on CentOS
- Demo: Installing and configuring Cassandra
Lesson 5 — Cassandra Data Model
- Keyspaces, tables and columns
- Cassandra DDL: CREATE TABLE, ALTER TABLE, DROP TABLE
- Cassandra DML: INSERT, UPDATE, SELECT, DELETE
- SELECT statements restrictions
- Demo 1 – Data Definition and Data Manipulation Statements
- Demo 2 – Create a Table with Composite Key
- Demo 3 – Collection Columns in Cassandra
Lesson 6 – Cassandra Interfaces
- Cassandra Command Line Interface
- CQLSH options
- Java interfaces
- Compiling a Java Program for Cassandra
- ODBC interface
Lesson 7—Advanced Architecture and Cluster Management
- Partitions
- Replication strategy
- Consistency
- Monitoring and Administration tools
Lesson 8—Hadoop Ecosystem around Cassandra
- Apache Storm, Storm Architecture, Data Model, and Topology
- Apache Kafka, Kafka Architecture, Data Model
- Real -time analysis platform
- Apache Spark, Spark Architecture
- Spark and Scala Classrom/LVC Discussion items:
- Question Papers
- Lab Exercises
- Case study
RESOURCES
Getting started |
Operating Cassandra |
Cassandra Architecture |
Cassandra’s Tools |
Data Modeling |
Troubleshooting |
Cassandra Query Language |
Cassandra Development |
FAQs |
Configuration |