Thanks to initial work by folks at Wellesley and Wikidata work from 9of99 on Wikipedia, the Newspapers on Wikipedia project has both created an initial Wikidata set of extant U.S. Newspapers and mapped that to needs for page and infobox creation.
The full set is here and can be queried in multiple ways:
Visually these maps overstate needs in high density areas, since the red dots (needs page) take precedent over blue dots (has page) in a conflict, and the data has a geolocation that is only as granular as the town (hence Chicago has one geolocation). And the data will need continued cleanup — I’ve spotted a few issues just screenshotting regions. But this initial data set will be developed alongside the rest of the project, and even when papers don’t make it into Wikipedia, we’ll make sure the Wikidata on them is accurate, and try to match them with other sets of data as we go forward.
According to the data here (which again, is imperfect) the current counts are:
- Has Wikipedia page and Infobox: 957
- Needs Infobox: 84
- Needs Page: 3775
(We’ve already put a dent in some of the work before this, so we’ll go back and manually tally up a baseline.)
Anyway, some maps. Keep in mind this is very preliminary.