20 DAYS | 140 HOURS TRAINING PROGRAMME
ONLINE OR FACE-TO-FACE TRAINING
INTRODUCTION
Big data is a term used to describe large, complex data sets that are difficult to process using traditional data processing methods. Big data is a rapidly growing field, and new applications for big data are being developed all the time. As big data becomes more widely used, it is likely to have a significant impact on businesses, governments, and society as a whole. It is characterised by its volume, velocity, variety, and veracity.
Volume refers to the sheer size of big data sets. Big data sets can be terabytes, petabytes, or even exabytes in size.
Velocity refers to the speed at which big data is generated and collected. Big data sets can be generated in real time, or they can be collected over a period of time.
Variety refers to the different types of data that can be included in big data sets. Big data sets can include structured data, unstructured data, and semi-structured data.
Veracity refers to the accuracy and reliability of big data sets. Big data sets can be noisy or incomplete, which can make it difficult to draw accurate conclusions from them.
Big data is often used to analyse trends, make predictions, and improve decision-making. Some examples of big data applications include:
Fraud detection - Big data can be used to identify patterns of fraudulent activity, such as credit card fraud or insurance fraud.
Customer segmentation - Big data can be used to segment customers into groups based on their demographics, interests, and purchase history. This information can then be used to target customers with specific marketing campaigns.
Product development - Big data can be used to collect feedback from customers and identify trends in product usage. This information can then be used to improve existing products or develop new products.
EXAMPLES OF BIG DATA
Social media data - This includes data from Twitter, Facebook, Instagram, LinkedIn, TikTok, Threads and other social media platforms.
Sensor data - This includes data from sensors that are used to monitor physical processes, such as temperature, humidity, and traffic flow.
Log data - This includes data from servers, applications, and devices.
Financial data - This includes data from stock markets, banks, and other financial institutions.
WHAT YOU WILL LEARN
Hadoop and Apache Spark - two of the most popular big data frameworks. Hadoop is a distributed file system that is used to store and manage big data sets. Apache Spark is a unified analytics engine that can be used to process big data sets in real time.
Business Intelligence mindset - the ability to think critically about data and use it to make informed decisions. It involves understanding the different types of data, how to collect and store data, and how to analyse data to identify trends and patterns.
Data Optimisation - the skills that are used to improve the performance of big data applications such as data compression, data deduplication, and data caching.
Emerging Technology Synthesis - the ability to understand and apply new technologies to big data problems. This includes technologies such as artificial intelligence, machine learning, and natural language processing.
Solution Architecture - the process of designing and implementing big data solutions. This includes tasks such as identifying the business requirements, designing the data architecture, and selecting the right big data technologies.
System Integration - the process of connecting different big data systems together. This includes tasks such as data mapping, data transformation, and security configuration.
WHO THIS BOOTCAMP IS FOR
WHO THIS BOOTCAMP IS FOR
This bootcamp focuses towards engineers from various fields and are:
People who are new to big data - If you are new to big data, this bootcamp will give you a comprehensive introduction to the field. You will learn about the different big data technologies, how to use them to solve real-world problems, and how to develop a career in big data.
People who want to change careers - If you are looking to change careers into big data, this bootcamp can help you get the skills that you need. You will learn about the most in-demand big data skills, and you will have the opportunity to build a portfolio of projects that you can use to show potential employers.
People who want to advance their careers - If you are already working in big data, this bootcamp can help you advance your career. You will learn about the latest big data technologies, and you will have the opportunity to network with other professionals in the field.
WHAT YOU WILL NEED
To take this course, you will need a computer with a working internet connection... and commitment to invest your time in up skilling your self in this new and exciting technologies.
HOW THIS COURSE WILL BENEFIT YOU
This bootcamp focuses towards engineers from various fields and are:
Learning the fundamentals of big data such as data collection, storage, processing, and analysis. This will give you a solid foundation in big data and prepare you for more advanced topics.
Learning the latest big data technologies such as Hadoop, Spark, or Hive. This will give you the skills you need to work with big data in the real world.
Developing your big data skills through hands-on exercises and projects. This will give you the experience you need to succeed in a big data career.
This masterclass can give you a competitive edge in the job market. As the demand for big data professionals grows, employers will be looking for candidates with the skills and experience.
JOB OPPORTUNITIES
Data Scientist - Data scientists are responsible for collecting, cleaning, and analysing large datasets. They use their skills to extract insights that can be used to improve business decisions.
Big Data Engineer - Big data engineers are responsible for building and maintaining big data infrastructure. They work with Hadoop, Spark, and other big data technologies to ensure that data is stored and processed efficiently.
Business Intelligence Analyst - Business intelligence analysts use data to help businesses make better decisions. They work with data visualisation tools to create reports and dashboards that help businesses understand their data.
Data Architect - Data architects design and implement big data solutions. They work with stakeholders to understand the business needs and then design solutions that meet those needs.
Data Analyst - Data analysts use data to solve business problems. They work with data to identify trends, patterns, and anomalies. They then use this information to help businesses make better decisions.
GENERAL COURSE GUIDE
The bootcamp will be taught by experienced engineers who will help students learn the skills they need to be successful in the technology/engineering industry. It is divided into 6 sections which are outlined below. Breakdown schedule of each section:
COURSE OUTLINE
(THIS IS A 20 DAYs BOOTCAMP PROGrAMME)
1. FUNDAMENTALS OF HADOOP & APACHE SPARK
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to use Hadoop and Apache Spark to process and analyse large datasets. By the end of the course, students will be able to:
Understand the principles of big data processing
Install and configure Hadoop and Apache Spark
Write MapReduce and Spark applications
Use Hadoop and Apache Spark to process and analyse large datasets
Learning Outcomes:
By the end of this course, students will be able to:
Understand the concepts and fundamentals of cyber-security and its significance in the digital age.
Identify the various forms of cybercrime and their implications for individuals, organisations, and society.
Evaluate the key principles and strategies involved in cyber-security management.
Comprehend the nature and impact of cyber terrorism, including internet radicalisation and terrorist use of the internet.
Analyse the cyber-terrorism framework and its implications for national and international security.
Examine real-world case studies to understand the practical application of cyber-security principles.
Prerequisites:
Basic understanding of computer science
Basic understanding of programming
HADOOP & APACHE SPARK OUTLINE:
The course will be divided into the following modules:
Introduction to Big Data
What is big data?
The challenges of big data processing
The benefits of big data processing
Hadoop
Introduction to Hadoop
The Hadoop Distributed File System (HDFS)
MapReduce
YARN
Apache Spark
Introduction to Spark
Spark SQL
Spark Streaming
Spark MLlib
Programming with Hadoop
Pig
Hive
HBase
Hands-on Hadoop
Installing Hadoop
Creating a Hadoop cluster
Running MapReduce jobs
Using Pig
Using Hive
Using HBase
Spark vs. Hadoop
The differences between Spark and Hadoop
When to use Spark
When to use Hadoop
Spark Streaming
Introduction to Spark Streaming
Working with live data
Building streaming applications
Spark MLlib
Introduction to Spark MLlib
Machine learning algorithms
Building machine learning models
Hands-on Spark
Installing Spark
Creating a Spark cluster
Running Spark jobs
Using Spark Streaming
Using Spark MLlib
Hadoop Security
Introduction to Hadoop security
Securing Hadoop clusters
Best practices for Hadoop security
Hadoop Administration
Introduction to Hadoop administration
Managing Hadoop clusters
Troubleshooting Hadoop problems
Hadoop Use Cases
Real-world use cases for Hadoop
How Hadoop is being used to solve big data problems
Hadoop in the Cloud
Running Hadoop in the cloud
Benefits of running Hadoop in the cloud
Challenges of running Hadoop in the cloud
Conclusion
Summary of the course
Next steps for learning more about Hadoop and Spark
Assessment:
The course will be assessed through a combination of quizzes and exercises.
2. DECISION & BUSINESS INTELLIGENCE
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to develop a business intelligence mindset. By the end of the course, students will be able to:
Understand the importance of business intelligence in today's business world
Identify the different types of business intelligence
Apply business intelligence techniques to solve business problems
Communicate the results of business intelligence analysis
Learning Outcomes:
By the end of this course, students will be able to:
Define business intelligence and explain its importance in today's business world
Identify the different types of business intelligence
Apply business intelligence techniques to solve business problems
Communicate the results of business intelligence analysis
Develop a business intelligence mindset
Prerequisites:
Some basic understanding of data analysis
Some basic understanding of statistics
DECISION & BUSINESS INTELLIGENCE OUTLINE:
The course will be divided into the following modules:
Introduction to Business Intelligence
What is business intelligence?
Why is business intelligence important?
The different types of business intelligence
Data Analysis for Business Intelligence
Data mining
Statistical analysis
Predictive analytics
Business Intelligence Tools
Microsoft Excel
Tableau
Applying Business Intelligence to Solve Business Problems
Customer churn analysis
Sales forecasting
Inventory optimisation
Communicating the Results of Business Intelligence Analysis
Creating reports
Presenting data
Developing a Business Intelligence Mindset
The importance of asking questions
The importance of visualising data
The importance of storytelling
Assessment:
The course will be assessed through a combination of quizzes and exercises.
3. DATA OPTIMISATION
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to optimise data for better decision-making. By the end of the course, students will be able to:
Understand the importance of data optimisation
Identify the different techniques for data optimisation
Apply data optimisation techniques to real-world datasets
Communicate the results of data optimisation analysis
Learning Outcomes:
By the end of this course, students will be able to:
Define data optimisation and explain its importance
Identify the different techniques for data optimisation
Apply data optimisation techniques to real-world datasets
Communicate the results of data optimisation analysis
Develop a data optimisation mindset
Prerequisites:
Have completed the above sections.
DATA OPTIMISATION OUTLINE:
The course will be divided into the following modules:
Introduction to Data Optimisation
What is data optimisation?
Why is data optimisation important?
The different types of data optimisation
Data Cleaning and Preparation
Data cleaning
Data preparation
Data Compression
Data compression techniques
The benefits of data compression
Data Sampling
Data sampling techniques
The benefits of data sampling
Data Encryption
Data encryption techniques
The benefits of data encryption
Data Visualisation
Data visualisation techniques
The benefits of data visualisation
Communicating the Results of Data Optimisation
Creating reports
Presenting data
Developing a Data Optimisation Mindset
The importance of asking questions
The importance of visualising data
The importance of storytelling
Assessment:
The course will be assessed through a combination of quizzes and exercises.
4. UNDERSTANDING NEW TECHNOLOGIES
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to understand and synthesise emerging technologies. By the end of the course, students will be able to:
Identify and understand the key concepts and technologies that are emerging
Identify the potential impact of emerging technologies on society and business
Develop skills in synthesising emerging technologies to solve real-world problems
Learning Outcomes:
By the end of this course, students will be able to:
Define emerging technologies and explain their importance
Identify the key concepts and technologies that are emerging
Analyse the potential impact of emerging technologies on society and business
Develop skills in synthesising emerging technologies to solve real-world problems
Develop a critical thinking mindset about emerging technologies
Prerequisites:
Have completed the previous sections.
UNDERSTANDING NEW TECHOLOGIES OUTLINE:
The course will be divided into the following modules:
Introduction to Emerging Technologies
What are emerging technologies?
Why are emerging technologies important?
The different types of emerging technologies
The Future of Technology
The future of artificial intelligence
The future of robotics
The future of quantum computing
The Impact of Emerging Technologies on Society
The impact of artificial intelligence on society
The impact of robotics on society
The impact of quantum computing on society
The Impact of Emerging Technologies on Business
The impact of artificial intelligence on business
The impact of robotics on business
The impact of quantum computing on business
Synthesising Emerging Technologies
How to synthesise emerging technologies
Case studies of emerging technologies
Developing a Critical Thinking Mindset
The importance of critical thinking
How to develop critical thinking skills
Conclusion
Summary of the course
Next steps for learning more about emerging technologies
Assessment:
The course will be assessed through a combination of quizzes and exercises.
5. SOLUTION ARCHITECTURE
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to design and implement IT solutions. By the end of the course, students will be able to:
Understand the principles of solution architecture
Design and implement IT solutions
Evaluate and select IT solutions
Communicate the benefits of IT solutions
Learning Outcomes:
By the end of this course, students will be able to:
Define solution architecture and explain its importance
Identify the different components of a solution architecture
Design and implement IT solutions that meet the needs of the business
Evaluate and select IT solutions that are cost-effective and scalable
Communicate the benefits of IT solutions to stakeholders
Prerequisites:
Have completed the previous sections.
SOLUTION ARCHITECTURE OUTLINE:
The course will be divided into the following modules:
Introduction to Solution Architecture
What is solution architecture?
Why is solution architecture important?
The different types of solution architectures
The Solution Architecture Process
The steps in the solution architecture process
The different roles involved in the solution architecture process
Designing IT Solutions
The different elements of an IT solution
The different design approaches
Implementing IT Solutions
The different implementation approaches
The different tools and technologies used for implementation
Evaluating and Selecting IT Solutions
The different evaluation criteria
The different selection methods
Communicating the Benefits of IT Solutions
The importance of communicating the benefits of IT solutions
The different communication methods
Conclusion
Summary of the course
Next steps for learning more about solution architecture
Assessment:
The course will be assessed through a combination of quizzes and exercises.
6. SYSTEM DESIGN & INTEGRATION
Course Objective:
The objective of this course is to provide students with the knowledge and skills they need to integrate different IT systems. By the end of the course, students will be able to:
Understand the principles of system integration
Identify the different challenges of system integration
Design and implement system integration solutions
Manage system integration projects
Learning Outcomes:
By the end of this course, students will be able to:
Define system integration and explain its importance
Identify the different challenges of system integration
Design and implement system integration solutions that meet the needs of the business
Manage system integration projects effectively
Communicate the benefits of system integration to stakeholders
Prerequisites:
Have completed the previous sections.
SYSTEM DESIGN & INTEGRATION OUTLINE:
The course will be divided into the following modules:
Introduction to System Integration
What is system integration?
Why is system integration important?
The different types of system integration
The System Integration Process
The steps in the system integration process
The different roles involved in the system integration process
Designing System Integration Solutions
The different elements of a system integration solution
The different design approaches
Implementing System Integration Solutions
The different implementation approaches
The different tools and technologies used for implementation
Managing System Integration Projects
The different project management methodologies
The different tools and techniques used for project management
Communicating the Benefits of System Integration
The importance of communicating the benefits of system integration
The different communication methods
Conclusion
Summary of the course
Next steps for learning more about system integration
Assessment:
The course will be assessed through a combination of quizzes and exercises.
7. HACKATHON
DURATION: HALF DAY | 3.5 HOURS
Hackathon Objective:
The objective of this hackathon is to leverage the skills and knowledge gained from the bootcamp courses to develop innovative solutions for creating smart cities. Participants will work in teams to develop data-driven applications that integrate various technologies, optimise data processing, and provide intelligent insights for urban planning and management.
Learning Outcomes:
By the end of the hackathon, students will be able to:
Define system integration and explain its importance
Identify the different challenges of system integration
Design and implement system integration solutions that meet the needs of the business
Manage system integration projects effectively
Communicate the benefits of system integration to stakeholders
Prerequisites:
Have completed the previous sections.
HACKATHON OUTLINE:
Hadoop and Apache Spark
Theme - Big Data Processing and Analytics for Smart Cities
Description - Participants will use Hadoop and Apache Spark to process and analyze large volumes of data collected from various sources within a city. This includes sensor data, social media feeds, transportation logs, and more.
Business Intelligence Mindset
Theme - Decision Support Systems for Urban Planning
Description - Participants will develop business intelligence solutions that enable data-driven decision-making for urban planners. They will create interactive dashboards, visualisations, and reports to provide insights on key metrics and trends related to city infrastructure, resource allocation, and citizen services.
Data Optimisation Skills
Theme - Data Cleansing and Integration for Smart Cities
Description - Participants will focus on optimising data quality and integration techniques. They will develop algorithms and workflows to clean and transform heterogeneous data sources into a unified format, ensuring accuracy and consistency for subsequent analysis and application development.
Emerging Technology Synthesis
Theme - Innovations for Sustainable Cities
Description - Participants will explore emerging technologies such as Internet of Things (IoT), artificial intelligence (AI), and machine learning (ML) to develop innovative solutions for energy optimisation, waste management, traffic control, and other sustainability challenges faced by cities.
Solution Architecture
Theme - Scalable and Secure Smart City Infrastructure
Description - Participants will design scalable and secure architectures for deploying smart city applications. They will consider factors such as data privacy, real-time processing, distributed systems, and integration with existing infrastructure, ensuring a robust foundation for their solutions.
System Integration
Theme - Seamless Integration of Smart City Components
Description - Participants will focus on integrating various systems and technologies required for a smart city ecosystem. This includes integrating data from different sources, coordinating communication between devices and applications, and ensuring interoperability among various components.
Assessment:
The course will be assessed through a combination of quizzes and exercises.
JUDGING CRITERIA
Innovation and Creativity - How unique and creative is the solution?
Impact and Relevance - Does the solution address a significant challenge faced by smart cities?
Technical Implementation - How well is the solution implemented from a technical perspective?
User Experience - Is the solution intuitive, user-friendly, and accessible?
Scalability and Sustainability - Can the solution scale to accommodate a large user base? Is it economically sustainable?
YOUR TRAINERS
Dr Khairul Anuar Abd Wahid - is a Senior Trainer at Marc & Zed. He has over 15 years of experience in the IT industry, and has worked as a trainer, lecturer, and consultant for software development, data science, machine learning, artificial intelligence, and cloud computing.
He has taught in Singapore, Malaysia, and the United States. He started his teaching career in 2007 as a lecturer for software engineering at the National University of Malaysia. In 2012, he joined a Silicon Valley startup, where he worked on developing machine learning algorithms for fraud detection.
Since 2017, Dr. Khairul has conducted training and workshops in Python, R, Machine Learning, Artificial Intelligence, Cloud Computing, and Data Science. He is also a certified Data Scientist and Machine Learning Engineer. He is a highly experienced and qualified trainer, and has a wealth of knowledge and experience in the IT industry. He is passionate about teaching and helping others to learn, and is committed to providing high-quality training that meets the needs of his clients.
Dr. Khairul is a valuable asset to the Marc & Zed Training team, and his expertise in data science and machine learning is highly sought after by businesses in Singapore and Malaysia. He is a passionate educator who is committed to helping others learn and grow.
Djoshkun Diko - has been working as a developer, trainer, coach, and consultant in software engineering since 2008. His expertise includes FullStack, DevOps, Cloud Computing (Amazon Web Services & Google Cloud Platform), PHP, JavaScript, C++, Laravel, Docker, Kubernetes, Golang, VueJS, Python, Shell scripting, HTML5/CSS, MySQL, MariaDB, PostgreSQL, MSSQL Server, Cassandra, and MongoDB.
Throughout his career as a Software Architect/developer/trainer, he has been involved in designing and executing distributed system architecture principles and patterns for applied machine learning products. He has contributed to various projects involving technologies such as Laravel, Symfony, Prestashop, NodeJS, ExpressJS, VueJS, MySQL, MongoDB, PostgreSQL, Camunda Microservices architecture with gRP, GoLang/Python & Echo (Go framework), Flask & Panda libraries (Python Framework), Angular, Docker & Kubernetes, and JIRA & Confluence (Atlassian products).
During his freelance career, he has collaborated with several companies, developing web pages, web shops, and forums using platforms such as Joomla, Wordpress, vBulletin, MyBB, and HTML.
In 2017, he joined Marc & Zed SPACES in Kuala Lumpur as an Assistant Trainer. Although he left Marc & Zed in 2019, his interest in the training field brought him back in February 2020 as a Principal Trainer and Coach. In this role, he conducts hybrid trainings in Singapore, Germany, and Malaysia. He has also taken on web development projects for Marc & Zed, including developing their own website and creating a CMS website for propertysifu.com.my, and providing training for their staff. Currently, he is working on developing a website and providing training for another client of Marc & Zed, Cameron Adams UK Ltd., a real-estate agency.
Dr Harjinthar Singh - is a Principal Trainer at Marc & Zed. He has over 25 years of experience in the IT industry, and has worked as a trainer, lecturer, and consultant for software development, product design, user interface, user experience, data analysis, RDBMS, video and image editing, and mobile development.
Dr. Singh has taught in Singapore, the United Kingdom, Malaysia, and Australia. He started his teaching career in 2001 as a lecturer for software engineering at London South Bank University. In 2012, he joined a Malaysian government agency, MIMOS Berhad. From 2016 to 2017, he taught Software & Mobile Development for undergraduates, staff re-training programmes, and post-graduates intending to pursue a career as programmers and developers.
Since 2017, Dr. Singh has conducted training and workshops in UI/UX, Interaction Design, Design Thinking, DevOps, MERN FullStack, Agile, JIRA, Git/GitLab, MySQL, MS SQL Server 2016, Infographics, Graphics/Video, and mobile/web development. He is also a certified Scrum Master and Product Owner.
Dr. Singh is a highly experienced and qualified trainer, and has a wealth of knowledge and experience in the IT industry. He is passionate about teaching and helping others to learn, and is committed to providing high-quality training that meets the needs of his clients.
OR E-MAIL FOR DETAILS AT janice@marcnzed.com
OR CALL +6012 451 4977 (MALAYSIA) OR +65 9052 3859 (SINGAPORE)
Certificate
Upon successful completion of the course, participants will be awarded a verified certificate issued by Universiti Kuala Lumpur [Advancement & Continuing Education (ACE) UNIKL] and co-signed by Marc & Zed SPACES
RELATED COURSES/PACKAGES