Datasets for journalists

https://ippsr.msu.edu/public-policy/state-networks

http://people.psych.cornell.edu/~jec7/data.htm

A century of UK general elections.

On Monday, the British government published a dataset of voting results, by party and parliamentary constituency, for every UK general election since 1918 — merging modern data with a handful of historical sources. — Data is Plural: August 21, 2019

Links:

https://researchbriefings.parliament.uk/ResearchBriefing/Summary/CBP-8647

https://www.scp.byu.edu/data/iceberg/

https://docs.google.com/spreadsheets/d/1gVHNx4kzXd947AFfQGiJg5zJrdNXrM81t2OC8UJFnw8/edit

Tags: environment mapping

North America

North American ecoregions.

In order to develop its maps of North American ecoregions, the US Environmental Protection Agency consulted with other federal agencies and state agencies, plus the governments of Canada and Mexico. Each “ecoregion” is an area with “similarity in the mosaic of biotic, abiotic, terrestrial, and aquatic ecosystem components with humans being considered as part of the biota.” The maps are available both as PDFs and as geospatial data files, at four levels of increasing specificity. [h/t Brandyn Friedly] — Data is Plural: July 10, 2019

Links:

https://www.epa.gov/eco-research/ecoregions

https://www.gov.uk/government/collections/statistics-higher-education-graduate-employment-and-earnings

https://irma.nps.gov/DataStore/Reference/Profile/2225713

Tags: environment mapping

global

Three centuries of taxation.

For 220 countries between the 1750s and 2018, the Tax Introduction Dataset tracks “the year of the first permanent introduction at the national level of government of six major taxes, as well as on the top statutory tax rate for that year.” The six taxes are those on personal income, corporate income, inheritance, and general sales, plus VATs and compulsory social security contributions. [h/t Philipp Heimberger + Laura Seelkopf] — Data is Plural: June 12, 2019

Links:

http://tid.seelkopf.eu/

Europe

Ethnonationalism.

Christina Isabel Zuber and Edina Szöcsik’s Ethnonationalism in Party Competition dataset compiles ratings for more than 200 political parties in 22 European countries. Experts rated the parties twice — first in 2011, and then again in 2017 — on a range of factors, such as the centrality of ethnonationalism to the parties’ platforms, and their positions on territorial autonomy for minorities. (Dataset access requires providing a name and email address.) [h/t Erik Gahner] — Data is Plural: January 30, 2019

Links:

http://christinazuber.com/data/

https://www.imf.org/en/Publications/WP/Issues/2017/04/03/Fiscal-Crises-44795

https://www.getthedata.com/open-units

https://data.cityofnewyork.us/Health/DOHMH-Dog-Bite-Data/rsgh-akpg

Tags: animals injury

global

Cattle, buffaloes, horses, sheep, goats, pigs, chickens, and ducks.

Last month, an international team of researchers published the third major version of their Gridded Livestock of the World dataset, which estimates the global distribution of cattle, buffaloes, horses, sheep, goats, pigs, chickens and ducks. The new dataset is based on 2010 statistics and provides estimates at “a spatial resolution of 0.083333 decimal degrees (approximately 10 km at the equator).” — Data is Plural: November 28, 2018

Links:

Tags: agriculture animals

global

International labor treaties.

Bilateral labor agreements regulate the migration of workers between two countries, and the Bilateral Labor Agreements Dataset aims to catalog as many of these treaties as it can. So far the University of Chicago Law School professors and researchers running the initiative have identified 582 treaties signed between 1945 and 2015. “However, this list is almost certainly underinclusive,” they write. “Many BLAs are not deposited in the major international treaty databases and they often do not receive much, if any, publicity.” [h/t Adam Chilton] — Data is Plural: November 28, 2018

Links:

https://www.law.uchicago.edu/bilateral-labor-agreements-dataset

http://stapi.co/

Tags: entertainment movies television

global

To swerve or not to swerve.

A recent study revealed the results of “the Moral Machine, an online experimental platform designed to explore the moral dilemmas faced by autonomous vehicles.” The experiment asked participants to decide whether a self-driving car — faced with two deadly options — should stay on course (killing one group of pedestrians) or swerve (killing another). The project “gathered 40 million decisions in ten languages from millions of people in 233 countries and territories,” and a dataset containing every decision is available to download. Read more: “Should a self-driving car kill the baby or the grandma? Depends on where you’re from.” [h/t Walt Hickey] — Data is Plural: November 7, 2018

Links:

Tags: technology transportation

Europe

European protests, 1980 to 1995.

A team led by University of Kansas professor Ron Francisco has collected and codified data on protests, strikes, and other “coercive acts” in dozens of European countries during the late 20th century. There’s a row for each day of each protest, and each row specifies the issue at stake, the organizers, their target, the type of action, and the location — as well as the number of protesters, arrests, injuries, and deaths. [h/t Alexandre Léchenet] — Data is Plural: November 7, 2018

Probably-fake political committees.

When the Federal Election Commission receives a registration form that contains “questionable information” from a candidate or committee, the agency asks for additional information. If the FEC doesn’t get a proper response, it adds the registration to its dataset of “unverified filers”. Among the 500+ registrations currently on the list: “VoldemortCantStopTheVote.org”, “Department of Treasury,” “Wookie PAC,” and “Al Pacino.” [h/t Chris Zubak-Skees] — Data is Plural: October 31, 2018

Links:

https://www.fec.gov/data/advanced/?tab=filings

Tags: elections

North America > USA

Coal cleanup funds.

What happens when coal mines shut down? Money for their cleanup is supposed to be ensured by a system of bonds. But when Climate Home News’ Mark Olalde investigated these remediation funds, he found “a system incapable of dealing with large-scale bankruptcies, amid a declining industry, which severely threatens the environment and future of coal-mining communities across the country.” You can download the data behind Olalde’s findings — including bond databases covering the “23 states that produce 99% of US coal,” obtained via public records requests. [h/t Megan Darby] — Data is Plural: October 31, 2018

Links:

Tags: energy environment

North America > USA

Electric utilities.

The U.S. Energy Information Administration uses Form EIA-861 to collect annual data from thousands of electric utilities about their sales, revenue, peak loads, customer counts, energy efficiency savings, and more. More than 3,400 utilities submitted the form (or its shorter cousin, EIA-861S) for 2017, and the data go back to 1990. [h/t Jordan Wirfs-Brock] — Data is Plural: October 31, 2018

Links:

https://www.eia.gov/electricity/data/eia861/

https://data.cityofnewyork.us/Transportation/Parking-Meters-GPS-Coordinates-and-Status/5jsj-cq4s

Tags: transportation

North America > USA

Medical marijuana in the Nutmeg State.

Connecticut’s Department of Consumer Protection has released a dataset listing all branded medical marijuana products registered with the state. For each of the nearly 4,000 products so far, the dataset describes the producer, brand name, form of dosage, and chemical potencies — plus links to images of each product and label. [h/t Kristin Hussey] — Data is Plural: October 10, 2018

Links:

https://data.ct.gov/Health-and-Human-Services/Medical-Marijuana-Brand-Registry/egd5-wb6r/data

http://fdotewp1.dot.state.fl.us/rightofway/DownloadData.aspx

https://pollofpolls.eu/

https://www.wider.unu.edu/project/sapi-social-assistance-politics-and-institutions-database

Tags: United Nations aid statistics

North America > USA

Family life.

The National Survey of Family Growth, run by the U.S. Centers for Disease Control and Prevention, “gathers information on family life, marriage and divorce, pregnancy, infertility, use of contraception, and men’s and women’s health.” Versions of the survey have been conducted nine times, dating back to 1973. The most recent results come from interviews of more than 10,205 people between September 2013 and September 2015. Related: The Pudding’s Amber Thomas used the data to explore trends in birth control. Bonus: Thomas also published the code and data behind her analysis. [h/t Giuseppe Sollazzo] — Data is Plural: September 26, 2018

Links:

Tags: family healthcare

Local lobbying.

Some cities — including San Francisco, Los Angeles, and Austin — provide downloadable databases of lobbyists who’ve officially registered to influence their administrations. Chicago has gone one step further, publishing data on lobbyists’ compensation, expenditures, gifts, and more. Previously: Lobbying data from the U.S. House, U.S. Senate, and European Union (DIP 2017.05.31 + DIP 2017.08.02). [h/t Alisha Green and Laurenellen McCann] — Data is Plural: September 26, 2018

Links:

Tags:

Europe > Netherlands

Urban archaeology.

https://dataverse.unc.edu/dataverse/harris

http://manufacturingmap.nikeinc.com/

Tags: business mapping

space

Rocket launches.

SpaceX’s API provides data on the company’s rockets, launchpads, launches, and more. It also will tell you the current orbital position of the car SpaceX launched into space. [h/t Mike Allred] — Data is Plural: August 15, 2018

Links:

Tags: business technology

North America > USA

Peer-to-peer loans.

The Lending Club, which matches borrowers with investors, publishes a dataset of all loans issued through its platform since 2007. The dataset’s many fields include each loan’s amount, term, interest rate, grade, status, and purpose (as a category, and often also a fuller description), as well as the borrower’s employer, home ownership status, and annual income. You can also download all declined loans, i.e., those “that did not meet Lending Club's credit underwriting policy.” Charlie Stanton] — Data is Plural: August 15, 2018

Links:

https://www.lendingclub.com/info/download-data.action

Links:

https://data.kcmo.org/Traffic/Kansas-City-Monthly-Car-Auction/32xf-gvw8

Tags: transportation

global

Anthony Bourdain’s travels.

Christine Zhang has compiled a CSV of 400+ locations featured in Anthony Bourdain’s No Reservations, The Layover, and Parts Unknown shows. The spreadsheet-as-remembrance includes each location’s name, country, latitude/longitude, plus the relevant episode’s show, season, number, and title. — Data is Plural: July 4, 2018

Links:

Tags: entertainment food mapping television

North America > USA

Regional Medicare usage.

The U.S. Centers for Medicare & Medicaid Services publishes a series of “geographic variation” spreadsheets, which cover hundreds of metrics — such as kidney dialysis usage, the total cost of medical tests, and hospital readmission rates — related to Medicare beneficiaries’ healthcare in each state, county, and “hospital referral region.” [h/t Drew Ivan] — Data is Plural: July 4, 2018

Links:

https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Geographic-Variation/GV_PUF.html

https://www.du.edu/korbel/sie/research/chenow_navco_data.html

https://aaronclauset.github.io/parental-leave/

Tags: education family

North America > USA

State campaign finance laws.

The nonpartisan Campaign Finance Institute has launched a database of current and historical state campaign finance laws. The information goes back to 1996 and describes each state’s contribution limits, various kinds of prohibitions, disclosure rules, and more. You can download the full dataset or explore it online. [h/t Rachel Shorey] — Data is Plural: April 18, 2018

Links:

Tags: elections money

North America > USA

Executive orders.

The U.S. Office of the Federal Register publishes structured data on every presidential executive order since 1994. For each of the 886 entries, the dataset provides the order’s title, the date it was signed, the president who signed it, and where to find it in the Federal Register. [h/t u/cavedave] — Data is Plural: April 18, 2018

Links:

https://www.federalregister.gov/executive-orders

http://www.openpowerlifting.org/data.html

https://www.diver.orr.noaa.gov/deepwater-horizon-nrda-data

Tags: disaster environment

North America > USA

High-profile sexual assault timelines.

Rebecca Zisser and Lazaro Gamio at Axios have compiled a timeline of alleged sexual assaults by Harvey Weinstein, Bill O'Reilly, Roger Ailes, Donald Trump, and Bill Cosby. For each of the 140+ cases recorded as of Oct. 20, the timeline indicates the year of the assault, the year the victim came forward (if they did), and the year of any legal settlement (if there was one). The underlying data is available as a spreadsheet. [h/t Mike Allen] — Data is Plural: November 1, 2017

Links:

https://docs.google.com/spreadsheets/d/10CWJHTzvGtkQgyz5bdkolz1KeZLqq7sYqNA3zQaPlYo/view#gid=1175970372

http://mocap.cs.cmu.edu/

https://storms.ngs.noaa.gov/

https://fred.stlouisfed.org/release?rid=199

https://securegrants.neh.gov/open/data/

https://www.eia.gov/petroleum/supply/monthly/

Tags: business energy

global

Brain scans.

The Open Access Series of Imaging Studies (OASIS) project is “aimed at making MRI data sets of the brain freely available to the scientific community,” with the goal of “[facilitating] future discoveries in basic and clinical neuroscience.” So far, the project has published two collections: a cross-sectional dataset of scans from 416 people, ages 18 to 96; and a longitudinal dataset, based on 150 people aged 60 to 96, each of whom were scanned at least two different times. [h/t Andrew Beam] — Data is Plural: August 16, 2017

Links:

http://www.oasis-brains.org/

Tags: healthcare

North America > USA

global

Global economic forecasts.

The International Monetary Fund’s World Economic Outlook Database contains the fund’s projections for future “national accounts, inflation, unemployment rates, balance of payments, fiscal indicators, trade for countries and country groups” and commodity prices. (They predict that farm-bred Norwegian salmon will cost $6.79/kg in 2022.) The database also contains historical observations for many of the economic indicators back to 1980. [h/t David Mihalyi] — Data is Plural: July 19, 2017

Links:

https://www.imf.org/external/pubs/ft/weo/2017/01/weodata/index.aspx

https://github.com/google-research-datasets/coarse-discourse

Tags: books entertainment language movies technology television

North America > USA

“The watch list Chicago police fought to keep secret.”

The Chicago Sun-Times has obtained and published an August 2016 copy of the Chicago Police Department’s “Strategic Subject List,” a database that scores nearly 400,000 (unnamed) people on a scale from 10 to 500, based on an algorithm that attempts to estimate their risk of being involved in gun violence (either as a shooter or a victim). The database includes demographic, geographic, criminal history, and other information about the people it ranks. “But the database doesn’t indicate — and the police won’t say — how much weight is given to each factor in computing the scores, which are produced using an algorithm developed at the Illinois Institute of Technology,” according to the Sun-Times. — Data is Plural: May 17, 2017

Links:

http://chicago.suntimes.com/politics/what-gets-people-on-watch-list-chicago-police-fought-to-keep-secret-watchdogs/

https://www.fueleconomy.gov/feg/download.shtml

https://www.nsf.gov/awardsearch/download.jsp

https://www.archives.gov/open/dataset-amendments.html

https://vpic.nhtsa.dot.gov/

https://www.icpsr.umich.edu/icpsrweb/content/NCAA/data.html

https://www.nist.gov/srd/nist-special-database-18

http://www.bankofengland.co.uk/research/Pages/onebank/threecenturies.aspx

https://www.buzzfeed.com/johntemplon/help-us-map-trumpworld

https://www.data.mil/s/v2/data-stories-an-overview-of-thor/a100cd16-c2a7-453b-8ea6-45947c1bbc51/

https://bison.usgs.gov/

Tags: animals plants statistics

global

Petroleum rig counts.

Since the 1940s, oilfield services corporation Baker Hughes and its predecessor companies have been publishing “rig counts” — the number of rigs actively drilling for oil and/or gas in various parts of the world. These days, the company updates its North America numbers every week and its international counts every month. As of December 16, they counted 637 rigs in — and offshore of — the United States, nearly half of them in Texas. [h/t Jordan Wirfs-Brock] — Data is Plural: December 21, 2016

Links:

Tags: energy history

North America > USA

The Affordable Care Act, quantified.

Last week, the U.S. Department of Health and Human Services released a dataset of state-level Obamacare metrics. The dataset is divided into five main categories: coverage gains, employer coverage, individual market coverage, Medicaid, and Medicare. Between 2010 and 2015, the proportion of Nevadans without health insurance dropped from 22.6% to 12.3% — the largest percentage-point decrease of any state. (In 2015, an estimated 17.1% of Texans still didn’t have health insurance, the highest rate of any state that year.) The metrics come from various sources, including the Census, academic studies, and the department’s own estimates. [h/t Nadja Popovich] — Data is Plural: December 21, 2016

Links:

https://aspe.hhs.gov/compilation-state-data-affordable-care-act

https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Information-on-Prescription-Drugs/2015MedicareData.html

Tags: drugs healthcare

global

Classical music, annotated.

“MusicNet is a collection of 330 freely-licensed classical music recordings, together with over 1 million annotated labels indicating the precise time of each note every recording, the instrument that plays each note, and the note's position in the metrical structure of the composition.” [h/t Lon Riesberg] — Data is Plural: December 7, 2016

Links:

http://homes.cs.washington.edu/~thickstn/musicnet.html

Tags: audio entertainment music

North America > USA

STEM surveys.

The IPUMS Higher Ed portal provides data from three “leading surveys for studying the science and engineering (STEM) workforce in the United States.” The surveys currently cover 1993 through 2013 and include questions about educational choices, demographics, employment outcomes, and more. Requires a free account. Michael A. Rice, a teacher at Ingraham High School in Seattle] — Data is Plural: December 7, 2016

Links:

https://highered.ipums.org/highered/

Tags: education science statistics

North America > USA

Chicago cab rides.

Last month, Chicago’s city government published data on more than 100 million local taxi rides taken in the city since 2013. (The city gathers the data through “periodic reporting by two major payment processors believed to cover most taxis in Chicago.”) The dataset contains each ride’s start/end times, pickup/dropoff location (based on Chicago’s “community areas”), distance, cost, payment type, and taxi company. Related: “Analyzing 1.1 Billion NYC Taxi and Uber Trips, with a Vengeance,” which contains pointers to similar data for New York City. [h/t Dan Nguyen] — Data is Plural: December 7, 2016

Links:

Tags: transportation

North America > USA

Solar panels.

The Open PV Project is a “community driven, comprehensive database” of solar panel installations in the U.S., ranging from home installations to utility-scale projects. The database, run by the Department of Energy, contains more than 1 million installations — with a total capacity of 16,000+ megawatts — and tracks their locations, sizes, costs, installers, and other variables. [h/t Dad] — Data is Plural: December 7, 2016

Links:

https://openpv.nrel.gov/index

https://data.austintexas.gov/Public-Safety/Declared-Dangerous-Dogs/ykw4-j3aj

http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/23380?classification=ICPSR.IX.&q=&sortBy=7

http://www2.ed.gov/about/inits/ed/edfacts/data-files/index.html

Tags: education

North America > USA

Congressional Research Service reports, in bulk.

State-level results.

Perhaps better known for its campaign-finance data, the Federal Election Commission also publishes official state-level results for presidential, House, and Senate elections going back to 1982. The results include all official candidates, and sometimes even write-ins (depending on the state). In the 2008 presidential election, eight Rhode Island voters wrote-in “Stephen Colbert,” five scribbled “Joe the Plumber,” and seven chose “Jesus.” — Data is Plural: September 28, 2016

Links:

http://www.fec.gov/pubrec/electionresults.shtml

http://www.cdc.gov/brfss/

Tags: HIV and AIDS healthcare statistics

North America > USA

Minimum wages.

Researchers at the Washington Center for Equitable Growth have compiled a dataset of current and historical minimum wages in America. The federal and state minimum-wage data stretches back to May 1974 — when the federal minimum was $2.00 per hour, or roughly equivalent $9.76 per hour in today’s dollars — while the data for cities and counties starts in January 2004. [h/t Ben Casselman] — Data is Plural: September 14, 2016

Links:

Tags: economics history money

North America > USA

The federal fleet.

The U.S. General Services Administration publishes an annual dataset about vehicles owned and leased by the federal government. The spreadsheets — which contain details on total inventories, cost, usage, and fuel consumption — go back to fiscal year 2011. In FY 2015, federal vehicles drove 4.8 billion miles, down about 9% from FY 2011. [h/t John Templon] — Data is Plural: September 7, 2016

Links:

http://www.gsa.gov/portal/content/102943

http://www.earthstat.org/

Tags: agriculture

North America > USA

Fatal car crashes.

https://data.gov.au/dataset/a8e3c0bc-44ac-4e9a-8b3c-b779438ddb10

http://www.who.int/immunization/monitoring_surveillance/data/en/

http://www.jonathanmpowell.com/coup-detat-dataset.html

https://www.st.nmfs.noaa.gov/commercial-fisheries/commercial-landings/

Tags: United Nations statistics

Africa

Farmers in Africa.

Between 2002 and 2004, researchers surveyed more than 9,500 farming households in 11 African countries to better understand how climate change might affect agricultural practices. Last month, they published the detailed results and documentation in Scientific Data. The dataset includes responses to questions about plantings, harvests, yields, water sources, animal purchases, taxes paid, and much more. — Data is Plural: June 1, 2016

Links:

http://www.nature.com/articles/sdata201620?WT.ec_id=SDATA-201605

Tags: agriculture climate statistics

North America > USA

Veterans in America.

In 2014, approximately 22 million U.S. military veterans were still alive, including 1 million who served in World War II, 7.2 million who served during the Vietnam War era, and 3.9 million who have served in post-9/11 wars. Those numbers come from the VA’s National Center for Veterans Analysis and Statistics, which publishes estimates and future-projections of the country’s veteran population. You can explore the data by age, race, ethnicity, gender, military branch, state, county, era of service, and more. (To see the files, click on the “Population Tables” header.) [h/t Charles Worthington] — Data is Plural: June 1, 2016

Links:

http://www.va.gov/vetdata/Veteran_Population.asp

Tags: healthcare

North America > USA

The Ku Klux Klan, 1915–1940.

Scholars at Virginia Commonwealth University have identified and mapped the locations of 2,000 KKK branches active in the early 20th century. The dataset contains the city, state, earliest-known-date, and sources for each “klavern.” Related: “Active Hate Groups in the United States in 2015,” a report by the Southern Poverty Law Center. [h/t K Reed] — Data is Plural: May 4, 2016

Researchers have analyzed 15 years of satellite imagery to create a nearly-global dataset of seasonal cloud coverage. The data — available at a kilometer-square resolution — could help scientists monitor and predict changes in ecosystems. [h/t Grant Smith + Joanna Klein] — Data is Plural: April 20, 2016

Links:

http://www.earthenv.org/cloud.html

Tags: history

global

Digital black markets.

Researcher Gwern Branwen has assembled an archive of listings posted to “dark net markets". Silk Road is the best-known among the group, but the collection covers scores of other markets, including Amazon Dark and FreeBay. The materials gathered from each site are slightly different; many include product advertisements and seller profiles. Warning: Some of the archives contain pictures, which may include offensive or disturbing imagery. And it’s probably wise to heed Gwern’s caveats: The scrapes “are large, complicated, redundant, and highly error-prone. They cannot be taken at face-value.” [h/t Mike Sconzo] — Data is Plural: March 30, 2016

Links:

http://www.gwern.net/Black-market%20archives

http://www.ers.usda.gov/data-products/county-typology-codes.aspx

http://britains-diet.labs.theodi.org/

http://www.matthewfuhrmann.com/datasets.html

Tags: energy

global

Movie chatter.

The Cornell Movie-Dialogs Corpus contains 220,579 “conversational exchanges” between 9,035 characters in 617 movies. Included: “Hello. My name is Inigo Montoya. You killed my father. Prepare to die.” — Data is Plural: February 3, 2016

Links:

http://www.cs.cornell.edu/~cristian/Cornell_Movie-Dialogs_Corpus.html

http://www.dhs.gov/tsa-claims-data

Tags: environment

North America > USA

State Department per diems.

When State Department employees travel on official business abroad, they can get reimbursed — to a point — for lodging, meals, and things such as laundry. The department publishes monthly spreadsheets of the maximum per diems, which vary by location. The highest right now? The Cayman Islands ($735 per day). The lowest? Antarctica ($0/day) and Iraq ($11/day). — Data is Plural: January 13, 2016

Links:

https://aoprals.state.gov/content.asp?content_id=233&menu_id=78

Tags: statistics

North America > USA

Retirees’ language preferences.

Last year, more than 2 million people applied for new Social Security retirement and survivor benefits. When they did, they indicated their preferred language. More than 93% said English, and about 5% of applicants said Spanish — the second most popular choice. Among the 88 other options: 1,616 applicants chose American Sign Language, 32 chose Japanese, nine chose Yiddish, and one chose Swedish. — Data is Plural: January 13, 2016

Links:

https://www.ssa.gov/open/data/LEP-Yearly-Spoken-Language-RSI-Claimants.html

http://advisory.mtanyct.info/LPUWebServices/CurrentLostProperty.aspx

https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Information-on-Prescription-Drugs/

Tags: business drugs healthcare

North America > USA

New Orleans slave sales, 1856–1861.

A new study in the American Economic Review suggests that slaveholders in the South underestimated the odds of “emancipation without compensation.” To reach its conclusions, researchers compiled a dataset of 15,377 slave sales, culled from remarkably detailed official records. Data for each sale includes demographic information about the slaves, seller, and buyer; the price paid; payment method; and researcher notes. — Data is Plural: December 30, 2015

Links:

https://www.aeaweb.org/articles.php?doi=10.1257/aer.20131483

Tags: history race slavery

global

The emjoiverse.

The Unicode Consortium publishes a big ol’ HTML table of every emoji, how they look in various contexts, and when they entered the canon. The “Christmas tree” emoji occupies code point U+1F384, and was introduced in 2010. (“Menorah with nine branches” arrived in 2015.) [h/t Ben Collins] — Data is Plural: December 23, 2015

Links:

http://unicode.org/emoji/charts/full-emoji-list.html

Tags: technology

North America > USA

Little’s big tree maps.

The Forest Service has digitized many of the tree species distribution maps from Elbert Little's “Atlas of United States Trees,” first published in the 1970s. Shapefiles and PDFs are available for for more than 600 species — including Ilex opaca (American holly) and Pseudotsuga menziesii (Douglas fir). — Data is Plural: December 23, 2015

Links:

http://esp.cr.usgs.gov/data/little/

https://github.com/fivethirtyeight/data/tree/master/tarantino

http://www.license.state.tx.us/licensesearch/licfile.asp

Tags: statistics

North America > USA

Good FOOD, bad food.

The CDC’s Foodborne Outbreak Online Database (FOOD) contains 18,000+ outbreaks, which resulted in 358,000+ illnesses and 13,000+ hospitalizations, from 1998 through last year. In 2008, a multi-state Salmonella Saintpaul outbreak hospitalized 308 people — the highest count in the database. — Data is Plural: December 9, 2015

Links:

http://wwwn.cdc.gov/foodborneoutbreaks/

http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

http://shootingtracker.com/wiki/Main_Page

http://www.cabq.gov/abq-data

https://blog.wikimedia.org/2015/09/25/wikipedia-editor-numbers/

Tags: social media statistics technology

North America > USA

What police-related data does your city publish?

The Police Open Data Census, created by Code for America fellows in Indianapolis, is tracking “currently available open datasets about police interactions with citizens in the US," including officer-involved shootings, use of force, and citizen complaints. The census currently covers 36 police departments. Related: The NYPD says it will start tracking all officer use-of-force incidents — not just gunfire — next year, the New York Times reports. — Data is Plural: October 21, 2015

Links:

Tags: crime justice

Select a region

Sort by tags

Tags

Search tips

Saved datasets

About this project

Additional resources

Dark-web screenshots.

North America > USA

Interconnecting roads.

North America > USA

How states relate.

North America > USA

State immigration laws.

South America > Brazil

Amazonian deforestation.

Bug fixes.

North America > USA

Deaths on the job.

Europe > United Kingdom

London bike infrastructure.

City street speeds and travel times.

Protected lands.

North America > USA

Rah, rah, rah! Fight, fight, fight!

Drama.

North America > USA

Publicly funded patents.

Africa > Central African Republic

CAR refugees.

Malaria geography.

Migrant deaths around the world

Movie shots.

Europe > Germany

German federal judges.

North America > USA

Congressional whip counts.

Citations and self-citations.

Multinational corporations.

Confidence.

Europe > United Kingdom

A century of UK general elections.

A decade of TV news words.

Historical terrorist groups.

Oil and gas.

TED talks.

European electricity.

Black tech conferences.

Airports and runways.

Europe > United Kingdom

140 years of London theatre.

North America > USA

State liquor prices.

Antarctic icebergs.

Hospitals, from Angola to Zimbabwe.

North America > USA

Federal judges.

North America > USA

Opioid distribution.

space > Russia

Soviet space dogs.

Europe > United Kingdom

UK ministerial resignations.

Patent geography.

Talk radio transcripts.

Foreign military trainings.

Europe > United Kingdom

Welsh shipping crews.

Drought conditions.

The height of the frozen world.

Hydro, streams, and rivers.

Bodies of water.

Child marriage rates around the world

Inter- and intra-national boundaries.

North America > USA

Foreign lobbyists.

International arbitration.

Two decades of UN Security Council debates.

Four decades of wildlife trade.

Ballparks.