The Open Data from governments are usually a wealth of information with comprehensive metadata. The minimal functions are CSV downloads, but usually there are online query and plotting functions and links to do bulk downloads.
- United States: data.gov OpenEI NYC AWS
- European Union: eurostat.eu data.europe.eu
- Canada: open.canada.ca
Energy, Petroleum and Geophysics
Oil and gas used to have a tight grip on their data, unless there were regulations for making information available. This is changing and extensive datasets are being made available to allow a wider audience to learn and come up with novel ideas.
- Data science publication by SPE: DSDE
- Worldwide energy production and consumption: BP IEA OPEC EIA Worldbank
- Reservoir and Production data:
- US Frac data: FracFocus FrackingData UCSC
- Reservoir models: Stanford VI SPE dataset PRIO
- Open Energy: Wikipedia listing
Miscellaneous Open Database
Protected lands. The UN’s World Database on Protected Areas is, it says, “the most up to date and complete source of information on protected areas, updated monthly with submissions from governments, non-governmental organizations, landowners and communities.”
City street speeds and travel times. Uber Movement, from the titular ride-hailing company, “shares anonymized data aggregated from over ten billion trips to help urban planning around the world.” Online, you can explore street speeds and estimated travel times for dozens of cities.
London bike infrastructure. Transport for London has launched its Cycling Infrastructure Database, which “contains the location of more than 240,000 pieces of cycling infrastructure in London, including places to park and the location of cycle lanes.”
Deaths on the job. Since 1992, the US Bureau of Labor Statistics’ has collected data on work-related deaths through its Census of Fatal Occupational Injuries. The results are presented as various cross-tabulations — by industry, demographic, circumstances, and more.
Bug fixes. Researchers at Brazil’s Federal University of Ceará have published a new dataset “composed of more than 70,000 bug-fix reports from 10 years of bug-fixing activity of 55 projects from the Apache Software Foundation.”