17-08-31 Machine Learning Projects
Category: Idea Lists (Upon Request)
<!-- gdoc-inlined -->
Ideas for new projects.
Need to rate these on:
-
Value to Mission
- Understanding Internally
- Reputation effects after publication (Mindshare)
- Onboarding to Services
- Advantage of Open Source Community
-
Time to Completion
- Ease of Data Acquisition
- Difficulty
-
Production - bring a model from TF to production in a web app or mobile, asking what tools are weak / missing. Do this for whatever library people want to gwet behind.
- Productionize a language based product. Website recommendation engine, book recommender, etc.
-
Applications of reinforcement learning. Creative real life applications
- Data Center Optimization - Predicting Power Usage Effectiveness
- Look to replicate google's data center work and turn it into a product.
- https://deepmind.com/blog/deepmind-ai-reduces-google-data-centre-cooling-bill-40/
- https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/42542.pdf
- Look to replicate google's data center work and turn it into a product.
- Data Center Optimization - Predicting Power Usage Effectiveness
-
Auto ml - create a scalable automated machine learning program, for mllib or just in general
-
Booster re-write - spark's gradient booster is soft, we should re-implement light-gbm in spark
-
Productionizing sklearn
-
Finance Applications
- NLP for financial modeling
- Renaissance Technologies run by the founders of neural network based speech recognition
- Sequence / Time Series Modeling
- Volatility Prediction
- Volume Modeling
- (Hard) Return Modeling
- Anomaly Detection
- Financial Crisis Prediction
- Fraud Detection
- Loyalty Marketing
- Customer Marketing
- Customer Service
- NLP for financial modeling
-
Health Care Applications
- Tumor Detection
- Tracking Tumor Development
- Blood Flow Quantification & Visualization
- Diabetic Retinopathy
- Radiology Diagnostics
- Automated Monitoring of Patients
-
Security / Attacks
- Yelp Fake Review Attack
- Adversarial Image Attack
- Inaudible Voice Commands
- Cybersecurity through DL / NLP for malware prediction
- Automated predictive modeling engine
- Model serving glue, become widely used open source / scalable as closed source
- Quant trading with advanced machine learning, replacing classic linear models regularized with more advanced models (gradient boosters, trees) to capture nonlinearities in the market (or create ensembles if it’s too easy to overfit)
- Supply chain optimization, using algos to make decisions about resource distribution and streamlining communications
- Solve matching problems. Buyer profile to perfect house matching, for example.
- People matching in a city, creating an extremely strong tool for who should meet one another
- Meta auto-selling site that has a trusted user claim to be selling something and auto-creates the relevant ads on craigslist, posts to facebook, etc.
- Idea matching. Semantically understand a person’s idea, and let them search for that idea in the history of human speech / creation, as well as for people thinking similar things.
- Website recommendation engine, a-la pandora. Same for articles cross website.
- Sound de-noising optimization
Source: Original Google Doc