The datasets we’re looking at this week

You read Dates are plural, a weekly newsletter with useful/curious datasets. Below you will find the July 20, 2022, issuereprinted with permission from FiveThirtyEight.

Edition 2022.07.20

New electoral laws, celebrities, budget divisions, digital trade regulations and the World Cup.

New electoral laws. The Voting Rights Lab has been tracking more than 2,000 laws proposed in US lawmakers since 2021. The tracker focuses on “12 key issues related to voter access and representation,” such as: E.g. early voting, same day registration and ID requirements. It lists each bill’s state, number, author, enactment date, current status, and areas of concern, along with a summary and “Laboratory’s assessment of whether the legislation is likely to improve or impair voter access or the administration of elections.” As seen in: “Did your state make voting difficult?” (Thirty-Five). relationshipted: Newsroom of the States Kira Lerner has compiled a table of 120 new election-related criminal sanctions, based in part on tracker data.

Remarkable people. “A new strand of literature aims to build the most comprehensive and accurate database of notable individuals,” observe Morgane Laouenan et al., who contribute a “cross-verified database of 2.29 million individuals” extracted from Wikidata and English, French, and German became , Italian, Spanish, Portuguese and Swedish editions of Wikipedia. The record includes birth and death dates, gender, citizenship, occupations, and other details for each person. So far: The MIT-based Pantheon dataset (DIP 2016.02.03), also based on Wikipedia and since updated. [h/t Philip Jung]

budget allocations. Congress decides how much money goes to each US federal agency and program through a process called appropriations. But the Office of Management and Budget, through a process called apportionment, ultimately sets the rules for spending those funds, “usually capped[ing] the obligations [an agency] may apply to specific periods, programmes, activities, projects, objects or a combination thereof.” These binding decisions were generally off-limits to the public — until last week, when OMB launched a database of fiscal 2022 splits under a requirement in Congress’ 2022 spending bill. [h/t Caitlin Emma]

Digital Trade Reservationons. Mira Burri et al.’s TAPED dataset, which “attempts to comprehensively trace developments in digital trade governance,” categorizes more than 100 relevant aspects of more than 300 preferential trade agreements signed since 2000. that the free trade agreement between Peru and Australia contains binding agreements on the protection of personal data, non-binding language on cyber security and no provisions on net neutrality.

The world chigh. Joshua Fjelstul’s World Cup Database, released this month, provides “extensively sanitized and cross-validated” information on each of the 21 FIFA World Cup tournaments played to date. Its 27 tables contain “approximately 1.1 million data points” on the participating teams, their players and managers, the referees, match results, goals, penalties and more.

Record Suggestions? Criticism? Praise? Send feedback to [email protected] Looking for previous records? This table contains them all. Visit subscribe and browse past issues.

Leave a Comment