Skip to main content

Data Analytics and Machine LearningLaajuus (5 cr)

Code: 3011633

Credits

5 op

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Enrollment

01.12.2024 - 31.12.2024

Timing

13.01.2025 - 01.05.2025

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • Finnish
  • English
Seats

15 - 40

Degree programmes
  • Degree Programme in Information and Communication Technology
  • Degree Programme in Business Information Technology
Teachers
  • Golnaz Sahebi
Groups
  • PTIVIS23W
    Software Development and Information Systems
  • PTIETS23swis
    Software Development and Information Systems

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Materials

Course book:

Aurélien Géron.
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
2nd Edition.
Publisher : O'Reilly Media; 2nd edition
(October 15, 2019)

We study chapters 1, 2, 3, 4, 6, 9, and 10 of the book. They have about 300 pages, but some are skipped over.

The course book can be read in electronic form from our institution's eBook Central database.

The course also has reading material, which will be announced during the course.

Teaching methods

- Participating in lectures (theory and practice)
- Learning through hands-on programming (classwork assignments)
- Completing homework assignments
- Interacting with the teacher and classmates
- Enhancing knowledge through teamwork projects
- Following the flipped-classroom model (pre-session self-study of theoretical concepts followed by in-class practical application)

Exam schedules

No exam!

International connections

- The course includes approximately 14 theory and practice sessions, where students engage with practical tasks.
- Homework exercises will be assigned, with some parts demonstrated during contact sessions.
- A teamwork project will be introduced in the second month, requiring students to apply their teamwork skills and the knowledge gained from the course to implement their final project.
- A flipped-classroom model may be used for some lectures, where students study the theoretical content at home and focus on practical implementation and discussions during class.

Completion alternatives

The practice works and exercises are mainly performed using Python and Jupyter Notebook.

Student workload

+ Student Responsibilities:
1. Class Participation and Assignments:
- Active participation in all classes, including the completion of in-class assignments, which must be submitted during class hours.
2. Homework Assignments:
- Completing 8-10 individual homework assignments, partially demonstrated during contact sessions. The exact number of the assignments will be announced at the first lecture)
3. Final Project:
- A group project (2-3 students) to be completed over Weeks 46 & 47, culminating in a presentation in Week 48.

+ Student workload:
Contact hours (approximately):
- One introductionary session: 2h
- 13 times 3h theory and practice: 13 x 3h = 39 hours
- Final projects and presentations: 24 hours
- Home work: approximately 75 hours

Total: approximately: 140 hours

Content scheduling

+ The course includes approximately 14 guided working and theory sessions, 9 personal homework assignments, 8-9 classwork assignment and a teamwork project

+ Final project is done in groups of 2-3 people outside of guidance sessions. The group sets aside 15 minutes to present the group work during the last session.

+ Content scheduling
- Week 03: Course Introduction (2h)
- Week 04: Landscape of machine learning (3h)
- Week 05: Data exploration (3h)
- Week 06: Data preparation (3h)
- Week 07: Model training, selection, and evaluation (3h)
- Week 08: Winter break - Visualization (self-study)
- Week 09: Demonstrations of Exercises 1 – 4 (3h)
- Week 10: Classification (3h)
- Week 11: Training models (3h)
- Week 12: Decision trees (3h)
- Week 13: Unsupervised learning (3h)
- Week 14: Guidance to team work (3h)
- Week 15: Introduction to Neural networks (3h)
- Week 16: Demonstrations of Exercises 5 – 9 (3h)
- Week 17: Team work presentations (3h)

Further information

+ Qualifications:
- Python programming skills and skills in utilizing Pandas for data manipulation and Numpy for numerical operations and array handling
- Basic knowledge of probability, statistics and linear algebra

+ Communication Channel:
Itslearning and email

Evaluation scale

H-5

Assessment methods and criteria

1) The course is graded on a scale of 0-5

2) Students can achieve maximum 200 points from this course that contains:
- Participation and classwork assignments: participating on each lecture and submitting the related classwork assignment during the class hours 1+2 = 3p => 9 X 4 = 36 points.
- Homework assignments: each homework assignment has 10-15 points. There are 9 homework assignments =>minimum 90 points and maximum 135 points.
- Teamwork assignment: 29 points

Assessment criteria, fail (0)

The student did NOT get at least 50% of the points in teamwork assignment OR did not get at least 50% of the points in the homework assignments OR did not get at least 50% of the points in participation and classwork submission.

Assessment criteria, satisfactory (1-2)

The student got 50-65% of the points for the homework assignments AND got 50-65% of the points for the participation and classwork assignments submission AND got 50-65% of the points for the teamwork assignment.

Assessment criteria, good (3-4)

The student got 66-85% of the points for the homework assignments AND got 66-85% of the points for the participation and classwork assignments submission AND got 66-85% of the points for the teamwork assignment.

Assessment criteria, excellent (5)

The student got at least 86% of the points for the homework assignments AND got at least 86% of the points for the participation and classwork assignments submission AND got at least 86% of the points for the teamwork assignment.

Enrollment

01.12.2023 - 17.01.2024

Timing

08.01.2024 - 30.04.2024

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • Finnish
  • English
Seats

10 - 35

Degree programmes
  • Degree Programme in Business Information Technology
Teachers
  • Golnaz Sahebi
  • Matti Kuikka
Teacher in charge

Matti Kuikka

Groups
  • PTIETS22swis
    PTIETS22 Software Development and Information Systems

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Materials

Mainly according to the book (from chapters 1-10)
[Aurélien Géron] Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, Publisher : O'Reilly Media; 2022, 3rd Edition

In addition: Material prepared by the teacher, online material and tasks in the learning environment.

International connections

The course includes approximately 12 guided working and theory sessions, 9 personal practice tasks and group work.
*
Group work is done in groups of 3-4 people outside of guidance sessions. The group sets aside 15 minutes outside of guidance sessions to present the group work.

Student workload

Contact hours:
- Course start-up: 2h
- Weeks 3 - 5: Theory & practice (3h/week): 5 x 3h = 15h
- Weeks 9 - 15: Theory & practice (3h/week): 7 x 3h = 21h
- Week 16: Project work presentations: 3h
- In addition, in weeks 4 - 15, about 10 support and inquiry hours: 10 x 1h = 10h

Total contact hours: about 51 hours

Independent study and homework: about 90 h

Content scheduling

Week 2:
- Machine learning landscape (Introduction to machine learning)
Weeks 3 - 7:
- End-to-end machine learning process
Weeks 9-15:
- Presentation of teamwork
- Classification
- Training linear models
- Decision trees
- Unsupervised learning
- Neural networks (Introduction to neural networks)
Week 16: Presentation of project works

Further information

The course materials and assignments are at ITS.
The exercises are mainly performed using Jupyter Notebook.
Communication about the course via ITS, but also via the course's Teams channel.

Evaluation scale

H-5

Assessment methods and criteria

The course is graded on a scale of 0-5.

You get points from the exercises n the contact class and homework, which affect the evaluation by 3 units.
About half of the exercises are done in contact classes.

The project work also affects the evaluation by 2 units. The project work is graded in ITS from 0 to 5, which is influenced by both the teacher's assessment and the peer assessment given by the rest of the project team.

The course can only be passed by doing both practice tasks and participating in project work.

Assessment criteria, fail (0)

The student does NOT participate in the project work or gets the project work grade 0 (Failed) OR did not get at least 40% of the points in the course exercises.

Assessment criteria, satisfactory (1-2)

The student got 40-59% of the points for the exercises in the course AND got a grade of 1 - 3 for the project work.

Assessment criteria, good (3-4)

The student got 60-84% of the points for the exercises in the course AND got a grade of 3 - 4 for the project work.

Assessment criteria, excellent (5)

The student got at least 85% of the points for the exercises in the course AND got a grade 5 for the project work.

Enrollment

30.11.2022 - 19.01.2023

Timing

09.01.2023 - 28.04.2023

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • Finnish
Seats

20 - 35

Teachers
  • Matti Kuikka
  • Tuomo Helo
Groups
  • PTIETS21swis
    PTIETS21 Software Development and Information Systems

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Evaluation scale

H-5

Enrollment

30.11.2022 - 19.01.2023

Timing

09.01.2023 - 28.04.2023

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • English
Seats

10 - 35

Teachers
  • Golnaz Sahebi
  • Matti Kuikka
Groups
  • VAVA2223

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Materials

Course book:

Aurélien Géron.
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
2nd Edition.
Publisher : O'Reilly Media; 2nd edition
(October 15, 2019)

We read chapters 1-10 of the book of menus. They have about 300 pages, but some are skipped over.

The course book can be read in electronic form from our institution's eBook Central database.

The course also has reading material, which will be announced during the course.

International connections

The course includes approximately 12 guided working and theory sessions, 9 personal practice tasks and group work.
*
Group work is done in groups of 3-4 people outside of guidance sessions. The group sets aside 15 minutes outside of guidance sessions to present the group work.

Content scheduling

Introduction to machine learning:
- data exploration
- data processing and preparation
- model training, selection, and evaluation
- taking the model into production
- supervised learning
- unsupervised learning
- visualization

We proceed in general according to the chapters in the course book.

Evaluation scale

H-5

Assessment methods and criteria

The course is graded on a scale of 0-5.
*
In order to receive an approved performance, the student must receive an acceptable mark for both 1) personal practice tasks and 2) group work.
*
You can get at least 10 points for each practice task. You can therefore get a maximum of 90 points from all 9 practice tasks.
Personal practice tasks: 25 points -> grade 0.5; 38 -> 1.0; 50 -> 1.5; 63 -> 2.0; 75 - 2.5; 88 -> 3.0. The tasks are checked in the demos. Must be present at the demo sessions.
Teamwork final project: 0.0 - 2.0.
*
(In both cases, 0.5 is the first accepted grade)

Enrollment

11.12.2021 - 21.01.2022

Timing

10.01.2022 - 25.04.2022

Number of ECTS credits allocated

5 op

Mode of delivery

Contact teaching

Unit

Engineering and Business

Campus

Kupittaa Campus

Teaching languages
  • Finnish
Seats

0 - 50

Teachers
  • Matti Kuikka
  • Golnaz Sahebi
  • Tuomo Helo
Groups
  • PTIETS20swis
    PTIETS20 Software Development and Information Security

Objective

After completing the course the student:
- Can define the main concepts related to data analytics and machine learning
- Understands the value and the drivers for data analytics and machine learning
- Can describe the processes of data analytics and machine learning
- Can use some tools for data analytics and machine learning

Content

Introduction to data analytics and machine learning
Data analytics process and methods
Machine learning process and methods
Practical work

Evaluation scale

H-5