Vicky created a model dedicated to recommending pitches to the Cubs in games against the Cardinals. (Technically, this model could help any team – or any talented pitcher quite frankly – when throwing pitches against Cardinals players, but this model is specifically dedicated to her beloved Cubs.)
Vladimir Lazovskiy Data scientist working at the intersection of machine learning, content creation, and media.
In this project, Vladimir tackles the question: what is the relationship between machine learning and optimization? He explores how delivery companies can use the power of machine learning to forecast travel times between two locations and use the genetic algorithm to find the best travel itinerary for each delivery truck.
Alex chose to work with music data because it is a type of audio that can evoke emotion in addition to thought. When she listens to music, she asks herself, "Why does a particular song make me feel happy or sad?" The key of a song helps determine the feeling and is made up of the tonic note and the mode. For this project, she aimed to predict the mode.
In San Francisco, human waste is a growing issue, both for the people who run into it and for the people who have no other option than to relieve themselves on public streets. Mattie built a model that predicted where and when human waste will show up, which could be used to better inform resource allocation for programs like San Francisco’s Pitstop (a program that brings portable bathrooms to areas that have high homeless populations).
Alando Ballantyne Founder & Data Scientist, Sovereign Finance
Image analysis and classification is something that Alando is passionate about (specifically as it pertains to analyzing satellite imagery to generate economic data for emerging economies). In this post, he writes about a few of the more common pixel classification techniques used in remote sensing.
For his final project, Ankur decided to see if he could forecast hourly Uber demand across NYC neighborhoods. In addition to time-lagged features (such as previous week’s demand), he added information specific to each neighborhood to improve predictions.
Recommender systems are an effective key solution to overcome information overload. Oren wrote an article exploring the motivation behind recommendation systems, as well as providing an overview of different characteristics and potentials of various prediction techniques.
Orlando started this project to show the potential ethical conflicts created by our new algorithms. In every conceivable field, algorithms are being used to filter people. In many cases, the algorithms are obscure, unchallenged, and self-perpetuating.
Once again, the holiday season is upon us...Should you find yourself preparing the whole meal or offering to contribute a dish or two and in the mood for homemade culinary adventures, there’s a little web application, called the MenuPlannerHelper (abbreviated as MenuHelper) Heng-Ru May developed a while back that could come in handy.
Politicians have used gerrymandering, the practice of drawing political districts for partisan advantage, to skew elections since the early days of this great country...Joseph's goal was to build a tool that would let anyone optimize a map on whatever they think most important.
Rebekah's vision is to be able to attach a camera to the back of her bike, near the seat which captures video in real time and alerts of any cars that are approaching from behind. The alert would be an audio cue that is played in one of the apps that is already running -- Strava, Spotify, or Audible as examples.
To combine the functionality of individual song-based playlist generators with a focus on making content based recommendations, Zach created a web app that builds a hip-hop playlist of songs with similar lyrical meaning and mood around a song specified by the user.
Motorcycle Lean Assist uses a convolutional neural network to detect the lean angle of a motorcycle through image classification, providing you with rider feedback on your current lean angle so you don’t have to guess.
Tim's project explores the conversations about climate change that took place on Twitter in March 2017. With 1 million tweets from 560,000 users, Tim identified people belonging to different communities and used tools such as the Twitter API, Spark, NetworkX, and Gephi to derive insight from those conversations.
Naoya explores the intersection between data and art by designing a recurrent neural network utilizing Long Short-Term Memory nodes (LSTMs) to learn patterns in the Six Cello Suites by J.S. Bach and generate its own musical fragments.
To combat flaws in other reccomendation systems, Will decided to use natural language processing of beer reviews to find similarity of language used to describe beers. He found the words people use to describe beer give better results than arbitrary scores or styles.
The legality of and public’s view towards marijuana is rapidly changing as more states decriminalize and legalize the drug. As such, how have the words associated with marijuana in news articles changed over time?
You don't have to be an expert to know that password security is a big issue for companies these days. It seems every other week you hear of a well known website getting hacked. Hasan Haq's project uses neural networks to generate "dictionary" word lists to be used in password cracking.
MemoTrek is an application that takes your travel photos as input and makes personalized recommendations for future travel destinations. It provides two types of recommendations: you-may-also-likes for a similar type of experience and something-different for new adventures.
Micheal Lai Strategy Consultant & Data Scientist at IBM
Micheal created a system that can track players in a basketball clip and translate them to a coordinate grid. This kind of motion tracking already exists in the form of SportVU - but you can use the accessibility of YouTube clips to create player tracking.
Visaurant is a reimagination of the way users search through images that they are interested in. One prime use case for Visaurant is in sorting and filtering through food images (hence VIS -ual rest- AURANT).
Who among us hasn’t fallen victim to the addictive power of a binge-worthy Netflix show? For Jamie's final project at Metis, she chose to explore elements in popular shows that might lead you to start “binge watching” on Netflix.
At a recent DataKind SF event, Ash was rather intrigued by the challenges faced in investigating wage theft and other labor violations not just throughout the nation, but also specific to California and the Bay Area regions.