This repo introduces databases for big data and how to interact with them.
-
Open this repository in a GitHub Codespace or clone it to your local machine.
-
Install required R packages:
install.packages("renv")
renv::restore()
- Download the NYC yellow taxi dataset:
wget -O nyctaxi.csv "https://data.cityofnewyork.us/api/views/kxp8-n2sj/rows.csv?accessType=DOWNLOAD"
- Sign up for a free account at motherduck.com
Run through the quarto notebooks dplyr.qmd
and motherduck.qmd
.