We will have presentations of research papers in the second half of the course. Presentations will be done in groups of 41 students, using Piazza to find teammates, where each group can indicate a ranked preference for a topic they’d like to present a paper in. A paper from the preferred topic2 will then be (randomly) assigned to the group.
Each group will only do one presentation. All members of the group will receive the same grade excepting in any special/extenuating circumstances.
Each group should collaborate to create an 8-minute presentation video of the assigned paper, based on which they will participate in a 5-6 minute live Q&A session. Research papers typically make a scientific contribution, which means that they propose or claim something that holds and that matters. The overall goal of the presentation is to convey the contribution made in the paper. For that purpose, the presentations should cover:
- Very briefly, what is the paper generally about?
- Background and motivation.
- What is proposed or claimed in the paper?
- What supporting evidence is provided?
- Why does the proposal/claim matter?
Only include as much mathematics as needed to convey the key message of the paper. Feel free to use diagrams and equations from the paper in your presentation (with proper acknowledgement).
Important Dates
- Fri 5 Feb, noon (wk 4)
- Each group should fill-in the paper presentation details form with the following information
- Student numbers and names
- Ranked topics in decreasing order of preference
- Any constraints on the date or time (e.g. if you’re in an unsuitable timezone) when the group cannot participate in the live Q&A.
- Mon/Wed, Week8…, 9am
- Each group should submit their presentation videos by 9am (UK time) the day before they are scheduled to present using the presentation slides form.
Presentation Schedule
Paper assignments and presentation schedules: https://edin.ac/3jXQRGN
Presentation Details
For the presentations, we expect that
- each team submits (only once) a single video file
- the video should be in MP4 format, and should be
- at a decent resolution (at least VGA)
- not too large—the submission form stipulates a 100Mb file size limit
- the video should be at most 8 minutes long
- the video should be in “normal” speed; i.e not sped up to fit time limit
- the first slide of your presentation should list
- paper title
- team ID (from the paper assignments)
- UUN and names of the team members no specific constraints on formatting or any other info you wish to include
- each team member should present as part of the video.
We suggest splitting into contribution, background, methods, and experiments.
Please upload the videos using the presentation slides form, naming your file appropriately as requested on the form.
There is no specific way we require you to record your presentations. You can record them however you see fit. However, some suggestions to record presentations with audio are
- using Open Broadcaster Studio (OBS)
- screen record with ffmpeg or similar command line tools
- using Office365 (Slide Show -> Record Slide Show)
- using Zoom or BBCollab
- any other method of choice (e.g. Jitsi+Dropbox)
Its entirely up to you whether you would prefer to record each members presentation individually and just concatenate/stitch them together into one video, or if you coordinate to record all members together in one go, or any other arrangement to suit.
Marking
The final grade will be based on the presentation, performance at the Q&A session, and (weighted) audience feedback on how well the presentation characterised the paper.
The marking criteria for the presentation will include
- style: fluency, coherence within team, and audience engagement
- understanding: characterise the paper faithfully and in an accessible way
- slide design: structure, key points, figures, and legibility
Furthermore, as part of evaluating engagement, for a different paper from the one they will present, but within the same topic, every student will also be required to
- indicate their choice of paper and provide a brief 1-paragraph summary along with a brief list (2-3) of questions about it, using the Engagement (Pre) form.
- attend the presentation of the paper (by the concomitant group) to subsequently provide feedback on that presentation using the Engagement (Post) form.
(Note: Each student only needs to attend the session where their chosen paper is being presented. However, you are of course welcome to also attend other sessions!)
The submission for the Engagement forms will depend on when the paper you have chosen is being presented. They will both need to be submitted on that day.
Suppose your chosen paper is being presented on Tue, 9th March 2021 (Class #1 in the schedule), then
- Engagement (Pre) should be submitted by 9am (UK Time) on 9th March
- Engagement (Post) should be submitted after class on 9th March
(only involves a few multiple-choice questions)
Note that in the majority of cases this will be in the same class that you’re presenting your team’s assigned paper in.
Adherence to these deadlines will be monitored by the course engagement tracker.
Papers
The following are the broad list of topics the groups will need to choose from. Papers listed within will be randomly assigned once topic is determined for a group.
PCA and its extensions
-
Independent Component Analysis: Algorithms and Applications
A. Hyvarinen
Neural Networks 2000 -
Robust Principal Component Analysis
E. Candes, X. Li, Y. Ma, and J. Wright
Journal of ACM 2009 -
Heterogeneous Component Analysis
S. Oba, M. Kawanabe, et al
Advances in Neural Information Processing Systems 21, 2008 -
Optimal Sparse Linear Encoders and Sparse PCA
M. Magdon-Ismail and C. Boutsidis
Advances in Neural Information Processing Systems 29, 2016 -
On Consistency and Sparsity for Principal Components Analysis in High Dimensions
I.M. Johnstone and A.Y. Lu
Journal of the American Statistical Association 2009 -
Single Pass PCA of Matrix Products
S. Wu, S. Bhojanapalli, et al
Advances in Neural Information Processing Systems 29, 2016 -
Provable Non-convex Robust PCA
P. Netrapalli, U.N. Niranjan, et al
Advances in Neural Information Processing Systems 27, 2014 -
A Generalization of Principal Components Analysis to the Exponential Family
M. Collins, S. Dasgupta, and R. E. Schapire
Advances in Neural Information Processing Systems 15, 2002 -
Semi-parametric Exponential Family PCA
S. Sajama and A. Orlitsky
Advances in Neural Information Processing Systems 18, 2005 -
Demixed Principal Component Analysis
W. Brendel, R. Romo, and C. K. Machens
Advances in Neural Information Processing Systems 24, 2011 -
Memory Limited, Streaming PCA
I. Mitliagkas, C. Caramanis, and P. Jain
Advances in Neural Information Processing Systems 26, 2013 -
Iterative Supervised Principal Components
J. Piironen and A. Vehtari
Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics (PMLR), 2018 -
Random Consensus Robust PCA
D. Pimentel-Alarcon and R. Nowak
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (PMLR), 2017 -
Demixed Principal Component Analysis of Neural Population Data
D. Kobak, W. Brendel, et al
eLife, 2016
Dimensionality reduction and data visualisation
-
Reducing the Dimensionality of Data with Neural Networks
G. Hinton and R. Salakhutdinov
Science 2006 -
On a Connection between Kernel PCA and Metric Multidimensional Scaling
C. Williams
Advances in Neural Information Processing Systems 14, 2001 -
Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data
M. Radovanovic et al
Journal of Machine Learning Research 2010 -
Nonlinear Dimensionality Reduction by Locally Linear Embedding (longer version)
L. Saul and S. Roweis
Science 2000 -
Visualizing Data using t-SNE
L. van der Maaten and G. Hinton
Journal of Machine Learning Research 2008 -
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
L. McInnes, J. Healy, and J. Melville
Arxiv:1802.03426 2018 -
Learning the Parts of Objects by Non-negative Matrix Factorization
D.D. Lee and H.S. Seung
Nature 1999 -
Static and Dynamic Source Separation using Nonnegative Factorizations: A Unified View
P. Smaragdis, C. Fevotte, et al
IEEE Signal Processing Magazine 31, 2014 -
PARAFAC. Tutorial and Applications
R. Bro
Chemometrics and Intelligent Laboratory Systems 1997 -
Dimensionality Reduction for Data in Multiple Feature Representations
Y.Y. Lin, T.L. Liu, and C.S. Fuh
Advances in Neural Information Processing Systems 22, 2009 -
Denoising and Dimension Reduction in Feature Space
M.L. Braun, K.R. Müller, and J.M. Buhmann
Advances in Neural Information Processing Systems 20, 2007 -
Temporal Regularized Matrix Factorization for High-dimensional Time Series Prediction
H.F. Yu, N. Rao, and I.S. Dhillon
Advances in Neural Information Processing Systems 29, 2016 -
Dimensionality Reduction of Massive Sparse Datasets Using Coresets
D. Feldman, M. Volkov, and D. Rus
Advances in Neural Information Processing Systems 29, 2016 -
Isomap Out-of-sample Extension for Noisy Time Series Data
H. Dadkhahi, M. F. Duarte, and Benjamin Marlin
IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), 2015 -
Dimensionality Reduction Using the Sparse Linear Model
I. A. Gkioulekas and T. Zickler
Advances in Neural Information Processing Systems 24, 2011
Performance evaluation, hyperparameter selection
-
Image Quality Assessment: From Error Visibility to Structural Similarity
Z. Wang, A. Bovik, et al
IEEE Transactions on Image Processing 2004 -
Random Search for Hyper-Parameter Optimization
J. Bergstra and Y. Bengio
Journal of Machine Learning Research -
Practical Bayesian Optimization of Machine Learning Algorithms
J. Snoek, H. Larochelle, and R. Adams
Advances in Neural Information Processing Systems 25, 2012 -
“Why Should I Trust You?” Explaining the Predictions of Any Classifier
M.T. Ribeiro, S. Singh, and C. Guestrin -
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Minimax-optimal Semi-supervised Regression on Unknown Manifolds
A. Moscovich, A. Jaffe, and N. Boaz
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (PMLR), 2017
Missing data, outliers, and anomaly detection
-
Isolation forest {for anomaly detection}
Liu et al
Eighth IEEE International Conference on Data Mining 2008 -
Removing Electroencephalographic Artifacts: Comparison between ICA and PCA
T.P. Jung et al
Proceedings of the 1998 IEEE Signal Processing Society Workshop 1998 -
Recommender Systems: Missing Data and Statistical Model Estimation
B. Marlin et al
Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI 2011) -
LOF: Identifying Density-Based Local Outliers
M. Breunig et al
Proceedings of the ACM SIGMOD International Conference on Management of Data 2000 -
Support Vector Data Description
D. Tax and R. Duin
Machine Learning 2004 -
Efficient Direct Density Ratio Estimation for Non-stationarity Adaptation and Outlier Detection
T. Kanamori, S. Hido, and M. Sugiyama
Advances in Neural Information Processing Systems 22, 2009 -
Generalized Outlier Detection with Flexible Kernel Density Estimates
E. Schubert, A. Zimek, and H. Kriegel
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014 -
Fast Memory Efficient Local Outlier Detection in Data Streams
M. Salehi, C. Leckie, J. C. Bezdek, T. Vaithianathan, and X. Zhang
IEEE Transactions on Knowledge and Data Engineering 28, 2016 -
Outlier Detection and Trend Detection: Two Sides of the Same Coin
E. Schubert, M. Weiler, and A. Zimek
IEEE International Conference on Data Mining Workshop (ICDMW), 2015 -
In-network PCA and Anomaly Detection
L. Huang, X. Nguyen, M. Garofalakis, M. I. Jordan, A. Joseph, and N. Taft
Advances in Neural Information Processing Systems 20, 2007 -
Feature Set Embedding for Incomplete Data
D. Grangier and I. Melvin
Advances in Neural Information Processing Systems 23, 2010 -
A Denoising View of Matrix Completion
W. Wang, M. A. Carreira-Perpinan, and Z. Lu
Advances in Neural Information Processing Systems 24, 2011 -
Robust Variational Autoencoders for Outlier Detection in Mixed-Type Data
Si. Eduardo, A. Nazabal, C. K. I. Williams, C. Sutton
Advances in Neural Information Processing Systems 32, 2019 -
Plant Functional Trait Change Across a Warming Tundra Biome
A. D. Bjorkman, I. H. Myers-Smith, et al
Nature 562, 2018 -
Why Normalizing Flows Fail to Detect Out-of-Distribution Data
Polina Kirichenko, Pavel Izmailov, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS) 2020 -
GAIN: Missing Data Imputation using Generative Adversarial Nets
Jinsung Yoon, James Jordon, Mihaela Schaar
International Conference on Machine Learning, 2018 -
Missing Data Imputation using Optimal Transport
Boris Muzellec, Julie Josse, Claire Boyer, Marco Cuturi
International Conference on Machine Learning, 2020
Fairness and Bias
-
Private traits and attributes are predictable from digital records of human behavior
M. Kosinski et al
Proceedings of the National Academy of Sciences 2013 -
Learning Fair Representations
R. Zemel et al
Proceedings of the 30th International Conference on Machine Learning 2013 -
Why Is My Classifier Discriminatory? Irene Y. Chen, Fredrik D. Johansson, David Sontag
Neural Information Processing Systems (NeurIPS) 2018 -
Improving fairness in machine learning systems: What do industry practitioners do?
Holstein, K., Vaughan, J. W., Daumé III, H., Dudík, M., & Wallach, H. (2018).
ArXiv:1812.05239, 2018 -
Equality of Opportunity in Supervised Learning
Moritz Hardt, Eric Price, Nathan Srebro
Neural Information Processing Systems (NeurIPS) 2016 -
Repair: Removing representation bias by dataset resampling
Yi Li, Nuno Vasconcelos
Computer Vision and Pattern Recognition 2019 -
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Zhao et al
Empirical Methods in Natural Language Processing 2017 -
Convnets and imagenet beyond accuracy: Understanding mistakes and uncovering biases
Pierre Stock, and Moustapha Cisse
European Conference on Computer Vision (ECCV) 2018 -
Women also snowboard: Overcoming bias in captioning models
Hendricks et al.
European Conference on Computer Vision (ECCV) 2018 -
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
Hila Gonen, and Yoav Goldberg
North American Chapter of the Association for Computational Linguistics 2019 -
Compositional fairness constraints for graph embeddings
Avishek Bose, and William Hamilton
International Conference on Machine Learning 2019 -
Balanced datasets are not enough: Estimating and mitigating gender bias in deep image representations
Wang et al
International Conference on Computer Vision 2019 -
Fairness Constraints: Mechanisms for Fair Classification
Zafar et al
Artificial Intelligence and Statistics (AISTATS) 2017 -
Mitigating unwanted biases with adversarial learning
Brian Hu Zhang, Blake Lemoine, and Margaret Mitchell
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. 2018.