Wikidata:Edit groups/OR/6ca12d684e3

From Wikidata
Jump to navigation Jump to search

Edit group OR/6ca12d684e3

Summary Batch of new Persons (mainly German scientists aktive in the years 1933 - 1945) which are not yet in Wikidata and have a GND and a birthday Author Richirikken
Number of edits 1,795 (more statistics) Example edit Q107166595

Discussion[edit]

@Richirikken:

Thank you for the cleaning --Richirikken (talk) 11:52, 10 June 2021 (UTC)[reply]

The contribution was part of a data donation I managed with @Elya:. You can find more information here: https://www.wikidata.org/wiki/Wikidata:GEPRIS_Historisch_(DFG) --Richirikken (talk) 11:55, 10 June 2021 (UTC)[reply]

@Elya, Richirikken: - why for a given GND ID, a new item was created, even if an existing item in WD already contained the GND ID? HumanAFuser (talk) 12:08, 10 June 2021 (UTC)[reply]

Hi @HumanAFuser: this was a huge upload which I did to my best knowledge. Altogether 3.255 new persons have been genarted from which 1.827 had a GND. All persons without have been professionally checked, if they have a GND or if they are already in Wikidata. The one case you found Q107176050 had a mssing GND in our data which is a mistake. I think this resulted from difficulties discerning proposals to the "Notgemeinschaft der Deutschen Wissenschaft" and the "Reichsforschungsrat" between this two person: https://gepris-historisch.dfg.de/person/5140444? https://gepris-historisch.dfg.de/person/5140442? The other duplicates where just a handful from more than 3.200 new persons. If you still find some duplicates, please let me know, I'll help to clean it up. best regrads --Richirikken (talk) 13:21, 10 June 2021 (UTC)[reply]

This doesn't answer the question. GND 117494038 is in WD since 2020-05-19 22:13 (Q94841812) [1], merged 2020-07-20 18:47 into Q59533615 [2], still 2021-06-09 Q107166380 containing GND=117494038 was created as a new item. I would assume that one checks for existence of a given GND ID before creating a new item with that GND ID. But maybe it were really only few cases. Property_talk:P227/Duplicates#human lists 378 more to check, but I don't if it contains any more GEPRIS humans. Overall: Great work, thanks a lot! HumanAFuser (talk) 13:39, 10 June 2021 (UTC)[reply]

As explained, I checked all cases for existing GND in Wikidata prior to upload. However the GND was missing on our side for the case (Q107176050), thus I was unable to detect this duplicate. You are welcome to add information to the new cases, which will find a way back in our database. Best regards --Richirikken (talk) 13:53, 10 June 2021 (UTC)[reply]

No, not as explained. You created items with GND ID, where the GND ID was already in WD. HumanAFuser (talk) 14:40, 10 June 2021 (UTC)[reply]

Yes you are right, for the upper cases. I'm not sure why I did't find these cases with open refine and hope this are just a few cases. I did a prior reconcilitation with the GND in open refine though and excluded more than 40 cases from upload. Do you have a special script to check for such cases in my upload, besides this page Property_talk:P227/Duplicates#human? --Richirikken (talk) 15:08, 10 June 2021 (UTC)[reply]

No, I have no special tool. But it seems not many with GND are left for checking. I don't know how OpenRefine works, I would have checked it with SPARQL via the WD query API and maybe a local script or database. HumanAFuser (talk) 15:14, 10 June 2021 (UTC)[reply]

Hi @HumanAFuser:, thank you for your help. Have you checked all cases with a double GND, or are there still open cases? --Richirikken (talk) 20:08, 10 June 2021 (UTC)[reply]

See above "each starting with Q107 remaining in Property_talk:P227/Duplicates#human solved, 18 as listed from GEPRIS merged, two other from other source. HumanAFuser (talk) 16:51, 10 June 2021 (UTC)" - but I have no special GEPRIS query. I am working on cleaning Property_talk:P227/Duplicates#human . HumanAFuser (talk) 20:14, 10 June 2021 (UTC)[reply]