Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
- Section 1: Introduction to Big Data / NoSQL
- Overview of NoSQL
- The CAP theorem
- When NoSQL is appropriate
- Columnar storage
- The NoSQL ecosystem
- Section 2: Cassandra Basics
- Design and architecture
- Cassandra nodes, clusters, and data centres
- Keyspaces, tables, rows, and columns
- Partitioning, replication, and tokens
- Quorum and consistency levels
- Labs: Interacting with Cassandra using CQLSH
- Section 3: Data Modelling – Part 1
- Introduction to CQL
- CQL data types
- Creating keyspaces and tables
- Choosing columns and data types
- Selecting primary keys
- Data layout for rows and columns
- Time to live (TTL)
- Querying with CQL
- CQL updates
- Collections (list / map / set)
- Labs: Various data modelling exercises using CQL; experimenting with queries and supported data types
- Section 4: Data Modelling – Part 2
- Creating and using secondary indexes
- Composite keys (partition keys and clustering keys)
- Time series data
- Best practices for time series data
- Counters
- Lightweight transactions (LWT)
- Labs: Creating and using indexes; modelling time series data
- Section 5: Cassandra Internals
- Understanding Cassandra design under the hood
- SSTables, memtables, and commit logs
- Section 6: Administration
- Hardware selection
- Cassandra distributions
- Cassandra node communication
- Writing and reading data to/from the storage engine
- Data directories
- Anti-entropy operations
- Cassandra compaction
- Choosing and implementing compaction strategies
- Cassandra best practices (compaction, garbage collection)
- Creating a test Cassandra instance with a low memory footprint
- Troubleshooting tools and tips
- Lab: Installing Cassandra and running benchmarks
Requirements
- Proficiency in a Linux environment (navigating the command line, editing files with vi or nano)
- For on-site courses: a laptop or desktop with at least 8 GB of RAM
- For remote courses: a functional Cassandra lab will be provided; participants only need a web browser
14 Hours
Testimonials (2)
Extensive knowledge of NoSQL environments, not only Cassandra (ex: HADOOP)
Stefan Marcoci - Videotron ltee
Course - Cassandra Administration
The 1:1 style meant the training was tailored to my individual needs.