Skip to main content

Data Analytics and Machine LearningLaajuus (5 cr)

Code: TT00CO52

Credits

5 op

Objective

After completing the course the student:
- Can define the main concepts related to machine learning
- Understands the value and the drivers for machine learning
- Can describe the processes of machine learning
- Can use some tools for data analytics and machine learning

Content

Machine learning process and methods
Practical work

Enrollment

04.12.2024 - 13.01.2025

Timing

13.01.2025 - 30.04.2025

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • English
Seats

0 - 60

Teachers
  • Golnaz Sahebi
Groups
  • PTIETS23deai
    Data Engineering and Artificial Intelligence
  • PTIVIS23I
    Data Engineering and Artificial Intelligence

Objective

After completing the course the student:
- Can define the main concepts related to machine learning
- Understands the value and the drivers for machine learning
- Can describe the processes of machine learning
- Can use some tools for data analytics and machine learning

Content

Machine learning process and methods
Practical work

Materials

Course book:

Aurélien Géron.
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
2nd Edition.
Publisher : O'Reilly Media; 2nd edition
(October 15, 2019)

We study chapters 1, 2, 3, 4, 6, 9, and 10 of the book. They have about 300 pages, but some are skipped over.

The course book can be read in electronic form from our institution's eBook Central database.

The course also has reading material, which will be announced during the course.

Teaching methods

- Participating in lectures (theory and practice)
- Learning through hands-on programming (classwork assignments)
- Completing homework assignments
- Interacting with the teacher and classmates
- Enhancing knowledge through teamwork projects
- Following the flipped-classroom model (pre-session self-study of theoretical concepts followed by in-class practical application)

Exam schedules

No exam!

International connections

- The course includes approximately 14 theory and practice sessions, where students engage with practical tasks.
- Homework exercises will be assigned, with some parts demonstrated during contact sessions.
- A teamwork project will be introduced in the second month, requiring students to apply their teamwork skills and the knowledge gained from the course to implement their final project.
- A flipped-classroom model may be used for some lectures, where students study the theoretical content at home and focus on practical implementation and discussions during class.

Completion alternatives

The practice works and exercises are mainly performed using Python and Jupyter Notebook.

Student workload

+ Student Responsibilities:
1. Class Participation and Assignments:
- Active participation in all classes, including the completion of in-class assignments, which must be submitted during class hours.
2. Homework Assignments:
- Completing 8-10 individual homework assignments, partially demonstrated during contact sessions. The exact number of the assignments will be announced at the first lecture)
3. Final Project:
- A group project (2-3 students) to be completed over Weeks 46 & 47, culminating in a presentation in Week 48.

+ Student workload:
Contact hours (approximately):
- One introductionary session: 2h
- 13 times 3h theory and practice: 13 x 3h = 39 hours
- Final projects and presentations: 24 hours
- Home work: approximately 75 hours

Total: approximately: 140 hours

Content scheduling

+ The course includes approximately 14 guided working and theory sessions, 9 personal homework assignments, 8-9 classwork assignment and a teamwork project

+ Final project is done in groups of 2-3 people outside of guidance sessions. The group sets aside 15 minutes to present the group work during the last session.

+ Content scheduling
- Week 03: Course Introduction (2h)
- Week 04: Landscape of machine learning (3h)
- Week 05: Data exploration (3h)
- Week 06: Data preparation (3h)
- Week 07: Model training, selection, and evaluation (3h)
- Week 08: Winter break - Visualization (self-study)
- Week 09: Demonstrations of Exercises 1 – 4 (3h)
- Week 10: Classification (3h)
- Week 11: Training models (3h)
- Week 12: Decision trees (3h)
- Week 13: Unsupervised learning (3h)
- Week 14: Guidance to team work (3h)
- Week 15: Introduction to Neural networks (3h)
- Week 16: Demonstrations of Exercises 5 – 9 (3h)
- Week 17: Team work presentations (3h)

Further information

+ Qualifications:
- Python programming skills and skills in utilizing Pandas for data manipulation and Numpy for numerical operations and array handling
- Basic knowledge of probability, statistics and linear algebra

+ Communication Channel:
Itslearning and email

Evaluation scale

H-5

Assessment methods and criteria

1) The course is graded on a scale of 0-5

2) Students can achieve maximum 200 points from this course that contains:
- Participation and classwork assignments: participating on each lecture and submitting the related classwork assignment during the class hours 1+2 = 3p => 9 X 4 = 36 points.
- Homework assignments: each homework assignment has 10-15 points. There are 9 homework assignments =>minimum 90 points and maximum 135 points.
- Teamwork assignment: 29 points


The assignments must be returned by the deadline to get the points. The assignments returned after the deadline will give you only half of the points.
Demonstrations of exercises during the contact session is mandatory without demonstration you will lose 50% of your marks.

Assessment criteria, fail (0)

The student did NOT get at least 50% of the points in teamwork assignment OR did not get at least 50% of the points in the homework assignments OR did not get at least 50% of the points in participation and classwork submission.

Assessment criteria, satisfactory (1-2)

The student got 50-65% of the points for the homework assignments AND got 50-65% of the points for the participation and classwork assignments submission AND got 50-65% of the points for the teamwork assignment.

Assessment criteria, good (3-4)

The student got 66-85% of the points for the homework assignments AND got 66-85% of the points for the participation and classwork assignments submission AND got 66-85% of the points for the teamwork assignment.

Assessment criteria, excellent (5)

The student got at least 86% of the points for the homework assignments AND got at least 86% of the points for the participation and classwork assignments submission AND got at least 86% of the points for the teamwork assignment.

Enrollment

29.11.2023 - 18.01.2024

Timing

08.01.2024 - 30.04.2024

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • English
Seats

10 - 50

Degree programmes
  • Degree Programme in Information and Communication Technology
  • Degree Programme in Business Information Technology
  • Degree Programme in Information and Communications Technology
Teachers
  • Golnaz Sahebi
Teacher in charge

Golnaz Sahebi

Groups
  • PTIETS22deai
    PTIETS22 Data Engineering and Artificial Intelligence
  • PTIVIS22I
    Data Engineering and AI

Objective

After completing the course the student:
- Can define the main concepts related to machine learning
- Understands the value and the drivers for machine learning
- Can describe the processes of machine learning
- Can use some tools for data analytics and machine learning

Content

Machine learning process and methods
Practical work

Evaluation scale

H-5