An undergraduate course on data mining.
This project is maintained by chatox
Continuous evaluation will be based in the following elements:
To pass the course under continuous evaluation, all of the following must be true:
If you fail to pass, you will have to take the resit exam. The resit exam replaces the theory grade (B and C in the list above).
To obtain a grade in the practical session, you must:
Extra points might be added to your grade, allowing you to have up to 12 points (instead of 10) in some practice sessions, however your total practice grade is capped at 10 points.
Automatically the lowest grade in the practice sessions will be discarded, which allows you to skip one session and still get the maximum grade.
Not coming to a practice session or not delivering your work means a zero grade in that session, unless you can justify your absence to the teaching assistant (profesor de prácticas).
Do not work alone and isolated during the practice session. You can prevent simple mistakes by talking to someone else. You can work in pairs, but each one should submit his/her own work individually and the works must be different.
Do not work in groups of three or more during the practice session. Feel free to exchange ideas with other students, but do not copy from others.
Copying from external sources without acknowledging them in your work, and copying the work from other person/group in your class, are considered by the university as serious misconduct (“falta grave”). The instructor will make a case for the university to sanction this serious misconduct, as per the university regulations, with a suspension from the university for a minimum of six months and a maximum of four years.
In-class tests and exams are individual work.
Not coming to an in-class test means a zero grade in that in-class test, unless you can justify your absence to the professor (profesor de teoría).
Copying during a test/exam, knowingly facilitating the copy of others, elaborating, lending, or facilitating instruments for copying during an exam, are considered by the university as serious misconduct (“falta grave”). The instructor will make a case for the university to sanction this serious misconduct, as per the university regulations, with a suspension from the university for a minimum of six months and a maximum of four years.
The exam will be with pen and paper.
Each practice session and assignment specifies what you should deliver.
Identify the authorship and date of each report with a paragraph on the first page, including:
All of your reports should end with the following statement:
I hereby declare that, except for the code provided by the course instructors, all of my code, report, and figures were produced by myself.
or, in the case of work done in pairs/groups (if any):
We hereby declare that, except for the code provided by the course instructors, all of our code, report, and figures were produced by ourselves.
These are some of the most common mistakes in reports, these deduct points from your grade:
Your code is delivered as a self-contained Python notebook. This notebook should be readable and understandable on its own by a person familiar with the course’s topic. Think of the notebook as a report in which you tell a story, and tell that story well and professionally.
Remember to identify the authorship and date of your code. Include as many markdown cells between code cells as needed to explain what you are doing and what we are looking at.
Follow good programming practices:
These are some of the most common mistakes in code, these deduct points from your grade:
Kernel > Restart and run all
works in your notebook, because that is how practice instructors review your codeDelivering consistently top-quality work takes time and effort, but it can be very rewarding both personally and professionally
Check your answers, your code, and your reports as many times as needed to ensure they are correct.
Be precise: use the section numbers (1.1, 1.2, …) of the notebook to present your results in the report, and refer to sections by their number. If you need to refer to your figures or tables, number them and refer to them by numbers. Do not include ambiguous statements or plots without a scale or a legend. Do not use colors if you do not explain what each color means. State clearly your assumptions and limitations.
Be careful with the presentation of your work. For instance, do not use low-quality screenshots, or poorly cropped screenshots showing toolbars and window borders. Instead, export and save high-quality images from each application. Do not copy-paste or screenshot tables into your report without making them to actual tables.
As a data scientist your reports and code should be (among other things) correct, understandable, pristine, clear, and pleasant to look at. Ensure you set aside enough time to review, improve, and polish your work. Get used to produce top-quality work and it will become a habit.
If you need help installing software or packages in your computer, please ask your classmates, for instance through the Aula Global’s forum. Teaching staff does not have the necessary bandwidth to debug your installation.
Please use the “Notice board” for general questions, and the forums “Questions about theory” and “Questions about practices” for questions on theory and practices. I get the same questions many times by e-mail and that’s not helpful for you or for me. So, unless it’s something specific to you, use the forums.
Please if you do need to write me or your practice instructors an e-mail because of something specific to you, include the name of the subject in your e-mail, and if necessary, the seminar/practice group in which you are. I’ve well over 100 students this trimester, which makes it harder to remember everybody.
Please if you do need to write me or the practice instructors an e-mail, send it from your UPF address, so we can give it priority and answer it promptly.
Feel free to ask me questions in English, Catalan, or Spanish, independently of the language of your group.
To ask for an appointment, send an e-mail to the course’s professor. No appointments will be given in the 72 hours before partial or final exams.