The second part of this course also includes a data mining mini-project. Every group (of 4 students) will be allocated a dataset to work on based on their preference.
Here is a list of suggested datasets for the mini-projects. You can also choose a dataset outside of this list, but you must clear it with the instructor by email.
Important Dates:You can find the instructions for this year's miniproject here. Please read these carefully.
Group No. | Members | Dataset |
1 |
Angus Scott Debadri Mukherjee Ziwei Peng Zhou Yu |
Predicting Cuisines of Recipes [info + check e-mail for data] |
2 |
Imran Khaliq Rajeev Ratan Cesar Juarez Ramirez Aart Meijer |
Short Term Movements in Stock Prices [info + data] |
3 |
Grant Robertson Anthony Jarvis Dan Benveniste Mustafa m Somalya |
Prediction of Gene/Protein Localization Dataset [info + data] |
4 |
Junjie Ye Jin Li Guannan Lu Qi Hu |
The Caravan Insurance Data [info + data] |
5 |
Daniel Zapata Jesus Emmanuel Vazquez Valencia Jose Sendra |
Identifying Malaria Parasites from Images [info + check e-mail for data] |
6 |
Shashank Mangla Iain McDermid Angus Taylor Matthew Gould | The Reuters-21578 Text Dataset [info + data] |
7 |
Xiaohong Zhao Shufei Zhang Qiuyu Li Yunfeng Zhu |
Web Quality Assesment [info + data] |
This page was revised by Stefanos Angelidis and is maintained by Nigel Goddard
Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk Please contact our webadmin with any comments or corrections. Logging and Cookies Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh |