Data Scientist with M.A. in sociology, B.A. in environmental sociology, and 5+ years' experience teaching statistics. Completed TripleTen's 10 month data science bootcamp and a real-world data science externship with DataSpeak. Currently accepting data analysis and statistics consulting projects May 2024.
View My LinkedIn Profile
Analyzed geo-location data to map and optimize a route around the world. Obtained and cleaned a geo-location data set of countries worldwide, then used the data to find the minimum distance between the points in Python.
Data science solution was downloaded to JSON and passed to software engineers who integrated it into a webpage. The results is a user-friendly webpage, where customers can interact with the global map.
Project won first place in the Jingle Bell Code Jam, December 2023.
A data set of longitude and latitudes for each country around the world was found open-source on Kaggle and can be accessed here.
country.csv
The dataset contains information on country abbreviation, latitude, and longitude for 245 countries around the world. The data were explored for missing values, duplicates, and incorrect values. Only one missing value was found, and it was dropped from the data. Data were otherwise in good condition and ready for analysis.
This problem is a take on the ‘Traveling Sales Man’ problem to optimize the route between multiple locations.
Concorde library in Python optimizes the route between many locations, with quick and efficient calculations.
Haversine distance to calculates ‘as the bird flies’ while taking into account the curvature of the earth.
First, a random model was fit as a baseline to find distance and time it would take Santa without data. This model chooses the next city randomly (if santa chose off vibes), and calculates the Haversine distance along the random route:
Random Route Results:
Random Start city: Antarctica
Random Route Order: ‘Antarctica’, ‘France’, ‘Australia’, ‘Bolivia’, ‘United States’, ‘Greenland’, ‘Afghanistan’, ‘Russia’, ‘Central African Republic’
Random Route Distance: 46977.53 miles
Random Route Time: 36.14 Hours
With Concorde library, Santa’s route was optimized to the shortest distance possible. This library was developed specifically to solve the Traveling Sales Man problem and can quickly and efficiently solve the best route between locations:
Optimized Route Results:
Optimal Start city: Afghanistan
Optimized Route Order: ‘Afghanistan’, ‘Russia’, ‘Australia’, ‘Antarctica’, ‘Bolivia’, ‘United States’, ‘Greenland’, ‘France’, ‘Central African Republic’
Optimal Distance: 31087 miles
Optimized Time: 23.91 hours
Time to Calculate Solution: 0.03 seconds
Santa has a big night ahead of him, delivering presents around the globe. Merry Mailers wants to help him optimize his trip! To create a model of his trip, nine locations were chosen from around the world: United States, Bolivia, Greenland, Antarctica, France, Central African Republic, Afhanistan, Russia, and Australia.
First, a random route was chosen as a baseline (if santa chose his next location off vibes). In his sub-optimal route, santa had to travel 50,000 miles and needed 35 hours to make it around the world. He wouldn’t have made it in time!
With Merry Mailer’s specialized route optimization algorithm, we were able to take santa’s trip down to 31,000 miles and just under 24 hours. This algorithm used state-of the art Concorde library and Haversine distance to minimize santa’s distance delivering from one house to another and speedily found an optimal solution in 0.03 seconds! We’re taking the pressure off santa by telling him to travel in this order: Afghanistan, Russia, Australia, Antarctica, Bolivia, United States, Greenland, France, Central African Republic. With our modeling sofware, santa save 20,000 miles and can feel good about making it everywhere he needs to go.
What’s next for Merry Mailers? We’re committed to follow up with each of our clients. We’ll measure santa’s success this season, and maybe add some more locations into our optimizer for next year. And it looks like we’ll be staying busy through the year - the Easter Bunny, St. Nicholas, and Tooth Fairy have been knocking at our door.