Header image

Introducing Geomancer: an open-source library for geospatial feature engineering

April 16, 2019 blog-post geospatial machine-learning feature-engineering geomancer open-source

Here at Thinking Machines, we work with a lot of geospatial data: we’ve identified gaps in OpenStreetMap (OSM), provided geospatial analytics for our clients, and harnessed machine learning to estimate poverty from satellite imagery. However, we realized that we were spending too much time in repetitive feature engineering tasks. So to operate on geospatial data at scale, we decided to automate our execution and delivery workflows.

Enter Geomancer, our open-source library for geospatial feature engineering! It leverages geospatial data such as OpenStreetMap (OSM) coupled with a data warehouse like BigQuery. We use this to create, share, and iterate features for our downstream machine learning tasks. This tool allows us to:

Let’s see Geomancer in action! Given a set of points, we can create a feature that gets the distance to the nearest supermarket within a 10-km radius:

Geomancer’s Core API is powered by a SQLAlchemy backend that handles the translation of a Spell into a SQL dialect. This makes the library highly-extensible, allowing you to add new feature-primitives and database backends for your specific use-case.

We hope that Geomancer can help you scale your geospatial feature engineering needs! You can get started by reading through our getting started demo and setup guide. You can find more details through the documentation. Lastly, contributions are welcome! Simply file an issue or submit a pull request through GitHub.

Are you interested in using machine learning and geospatial data to help you and your organization make better and more informed decisions? Get in touch with us at [email protected] to learn more!

MORE STORIES

Advanced Statistical Concepts for A/B Testing and Experimentation

We tackle some of the nitty-gritty, advanced statistical concepts that must be addressed by statisticians and data scientists when adopting A/B testing in practice.

Empowering ULSSI’s Innovation Team through an AI-powered knowledge system

We worked with ULSSI to create an idea management tool that helps their Innovation team collect, discover, and organize data all in one tool.

Using AI/Big Data to Analyze Urban Mobility Patterns During the Pandemic

Can we still monitor movement under new constraints? In this second part of a blog series with ADB, we use Waze CCP mobility data to show how traffic flow has changed throughout lockdown in major Philippine cities.