Open in app

Sign In

Write

Sign In

Andrew Doss
Andrew Doss

77 Followers

Home

About

Published in The Inner Join

·Pinned

The NFL Combine in Two Dimensions

Exploring data with Principal Component Analysis — Cool, but… We recently published an analysis of pandemic unemployment and drinking where we used a data analysis technique called Principal Component Analysis (PCA) to visualize several variables in one plot. …

Data Science

11 min read

The NFL Combine in Two Dimensions
The NFL Combine in Two Dimensions
Data Science

11 min read


Published in The Inner Join

·Pinned

What Makes Simone Biles the GOAT?

An analysis of elite women’s all-around gymnastics scores — Simone Biles vs. Everyone The ’96 Bulls or the 2017 Warriors. Tom Brady vs. Joe Montana. Lebron or MJ. Naming the “Greatest of All Time” is a sure source of speculation and controversy. In data science terms, we can’t make “all-else-equal” GOAT comparisons between athletes. We can’t control for factors like the team around…

Data Science

8 min read

What Makes Simone Biles the GOAT?
What Makes Simone Biles the GOAT?
Data Science

8 min read


Published in The Inner Join

·Pinned

Making a Simple Data Pipeline Part 1: The ETL Pattern

Schedule Python and SQL scripts to keep your dataset clean and up-to-date in a Postgres database — Want to try it yourself? First, sign up for bit.io to get instant access to a free Postgres database. Then clone the GitHub repo and give it a try! The problem Public and private data sources are plentiful but also problematic: Source data may get updated frequently but require substantial preparation before…

Data Engineering

7 min read

Making a Simple Data Pipeline Part 1: The ETL Pattern
Making a Simple Data Pipeline Part 1: The ETL Pattern
Data Engineering

7 min read


Published in The Inner Join

·Dec 2, 2021

How Our Interests Changed During the Pandemic

Here’s how U.S. Google searches for 350+ common hobbies diverged from pre-pandemic expectations — Things have changed Social distancing and sourdough starters. Masks and Mario games. Remote work and renovating. The pandemic has changed the way we’ve lived since March 2020, and that includes our hobbies. …

Data Science

6 min read

How Our Interests Changed During the Pandemic
How Our Interests Changed During the Pandemic
Data Science

6 min read


Published in The Inner Join

·Oct 5, 2021

Making a Simple Data Pipeline Part 4: CI/CD with GitHub Actions

Use Github Actions to automatically integrate ETL changes and republish your datasets — Want to try it yourself? First, sign up for bit.io to get instant access to a free Postgres database. Then clone the GitHub repo and give it a try! An actionable workflow Pipeline maintenance can be tedious at best and error prone at worst. Once a pipeline is in service, it requires ongoing…

Data Science

10 min read

Making a Simple Data Pipeline Part 4: CI/CD with GitHub Actions
Making a Simple Data Pipeline Part 4: CI/CD with GitHub Actions
Data Science

10 min read


Published in The Inner Join

·Sep 28, 2021

Making a Simple Data Pipeline Part 3: Testing ETL

Maintain data quality and catch bugs before your stakeholders — Want to try it yourself? First, sign up for bit.io to get instant access to a free Postgres database. Then clone the GitHub repo and give it a try! Are we done yet? In Part 1: The ETL Pattern and Part 2: Automating ETL, we completed a minimal, yet usable, data pipeline for ETL. …

Data Science

8 min read

Making a Simple Data Pipeline Part 3: Testing ETL
Making a Simple Data Pipeline Part 3: Testing ETL
Data Science

8 min read


Published in The Inner Join

·Sep 7, 2021

Make Your Own Air Quality Logger

Log IoT data to a cloud database using Python on a Raspberry Pi Measure what matters to you Years ago, I got back into C++ because I couldn’t sleep at night. I lived in a century-old apartment building in Seattle where my unit had the thermostat that controlled the boiler for the entire building. …

IoT

8 min read

Make Your Own Air Quality Logger
Make Your Own Air Quality Logger
IoT

8 min read


Published in The Inner Join

·Aug 31, 2021

Is West Coast Air Getting Dirtier?

Analyzing particulate matter trends in three major Pacific metros — The data for this analysis was analyzed with the help of bit.io, a standards-compliant cloud Postgres database. bit.io is the fastest way to get your data into a private, hosted Postgres database. Follow bit.io on Twitter at @bitdotioinc. Wildfire woes Wildfires are getting worse by nearly every metric. Fires in the West…

Air Quality

8 min read

Is West Coast Air Getting Dirtier?
Is West Coast Air Getting Dirtier?
Air Quality

8 min read


Published in The Inner Join

·Aug 24, 2021

Making a Simple Data Pipeline Part 2: Automating ETL

Schedule Python and SQL scripts to keep your dataset clean and up-to-date in a Postgres database — Want to try it yourself? First, sign up for bit.io to get instant access to a free Postgres database. Then clone the GitHub repo and give it a try! Where we left off In Making a Simple Data Pipeline Part 1: The ETL Pattern, we explained that the Extract, Transform, Load (ETL) process is…

Data Science

7 min read

Making a Simple Data Pipeline Part 2: Automating ETL
Making a Simple Data Pipeline Part 2: Automating ETL
Data Science

7 min read


Published in The Inner Join

·Jul 27, 2021

Decathletes: The Track and Field Generalists

Comparing Rio Olympic Decathletes to their specialist counterparts — Specialists vs. generalists Olympic medals can be decided by fractions of seconds. Each event requires a specific blend of genetics, training, and luck to have a shot at the podium. To reach the upper echelons of a sport, athletes typically specialize, often down to the event level. …

Olympics

5 min read

Decathletes: The Track and Field Generalists
Decathletes: The Track and Field Generalists
Olympics

5 min read

Andrew Doss

Andrew Doss

77 Followers

Engineer/Data Scientist @ bit.io

Following
  • Cassie Kozyrkov

    Cassie Kozyrkov

  • Daniel Liden

    Daniel Liden

  • Adam Fletcher

    Adam Fletcher

  • Xavier Amatriain

    Xavier Amatriain

  • Oz Nova

    Oz Nova

See all (8)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech