Big Data Engineering (5 op)
Toteutuksen tunnus: TT00CN70-3004
Toteutuksen perustiedot
Ilmoittautumisaika
04.12.2024 - 14.01.2025
Ajoitus
14.01.2025 - 30.04.2025
Opintopistemäärä
5 op
Toteutustapa
Lähiopetus
Yksikkö
Tekniikka ja liiketoiminta
Toimipiste
Kupittaan kampus
Opetuskielet
- Englanti
Paikat
0 - 80
Opettaja
- Tommi Tuomola
Ajoitusryhmät
- Ryhmä 1 (Koko: 35. Avoin AMK: 0.)
- Ryhmä 2 (Koko: 35. Avoin AMK: 0.)
Ryhmät
-
PTIETS23deaiData Engineering and Artificial Intelligence
-
PTIVIS23IData Engineering and Artificial Intelligence
Pienryhmät
- Ryhmä 1
- Ryhmä 2
- 14.01.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 21.01.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 21.01.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 21.01.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 28.01.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 28.01.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 28.01.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 04.02.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 04.02.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 04.02.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 11.02.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 11.02.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 11.02.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 25.02.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 25.02.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 25.02.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 04.03.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 04.03.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 04.03.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 11.03.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 11.03.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 11.03.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 18.03.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 18.03.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 18.03.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 25.03.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 25.03.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 25.03.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 01.04.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 01.04.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 01.04.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
- 08.04.2025 12:00 - 14:00, Theory, Big Data Engineering TT00CN70-3004
- 08.04.2025 14:00 - 15:00, Exercise, Big Data Engineering TT00CN70-3004
- 08.04.2025 15:00 - 16:00, Exercise, Big Data Engineering TT00CN70-3004
Tavoitteet
After completing the course the student can:
- describe basic solutions for data architectures and big data
- select and use suitable data architecture
- apply ETL process and tools for handling of big data
Sisältö
Architecture and Components of Big Data Frameworks
ETL process with Big Data for batch and streaming
Practical work with suitable tools and frameworks
Oppimateriaalit
Teacher provided lecture material
Supporting public online material
All needed material (or at least a link to them) will be available in itslearning.
Opetusmenetelmät
Contact learning, practical exercises, independent study
Tenttien ajankohdat ja uusintamahdollisuudet
There's an exam in April, re-exam in May.
Pedagogiset toimintatavat ja kestävä kehitys
Given examples and exercises support each topic studied during the lectures. Additional material in the form of tutorials and reliable information sources is provided.
Opiskelijan ajankäyttö ja kuormitus
Contact hours 44 h
Independent studying 91h, including:
- Studying the course material
- Completing exercises
- Exam
Sisällön jaksotus
-The basic idea of big data engineering methods and pipelines
-different components and processes
-integration of said components (MQ systems)
-data engineering frameworks (Apache family)
-The goal of the course is to be able to build a data pipeline from start to finish and to understand both the process and the different components and their role.
Viestintäkanava ja lisätietoja
Itslearning and contact classes are the main communication channels used on this course.
The student is required to have a computer capable of running a simple Ubuntu virtual machine and basic skills to work with Ubuntu command line.
Arviointiasteikko
H-5
Arviointimenetelmät ja arvioinnin perusteet
Homework exercises returned throughout the course
Small exam at the end of the course
11 sets of exercises, each worth 4 points, max 44 points.
Exam, max 22 points.
Required minimum to pass:
33 points in total.
22 points from the exercises.
11 points from the exam.
Hylätty (0)
< 33 points
Or < 22 points from the exercises
or < 11 points from the exam
Arviointikriteerit, tyydyttävä (1-2)
33-46 total points.
Arviointikriteerit, hyvä (3-4)
47-59 total points.
Arviointikriteerit, kiitettävä (5)
60-66 total points.