Property talk:P7859
This property is being considered for deletion. Please share your thoughts on the matter at this property's entry on the Properties for deletion page. |
Documentation
Lua error in Module:Property_documentation at line 1669: Entity not found.
Lua error in Module:Constraints at line 864: attempt to index local 'propertyentity' (a nil value).
This property is being used by: Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.) |
value starting with "viaf-" doesn't match VIAF ID (P214)-value. (Help)
Violations query:
SELECT ?item ?value { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P214 ?viaf . BIND( concat("viaf-", ?viaf ) as ?viafvalue) } FILTER NOT EXISTS { ?item wdt:P7859 ?viafvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?viafvalue } FILTER ( regex ( ?value, "^viaf-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P214
value starting with "lccn-" doesn't match Library of Congress authority ID (P244)-value. Some false positives: need to replace -%d with %06d. (Help)
Violations query:
SELECT ?item ?value ?lccn { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P244 ?lccn . BIND( concat("lccn-", ?lccn ) as ?lccnvalue ) } FILTER NOT EXISTS { ?item wdt:P7859 ?lccnvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?lccnvalue } FILTER ( regex ( ?value, "^lccn-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P244
Conflations
[edit]
type constraint
[edit]I added a new type constraint for the class event (Q1656682) because also conferences like German Librarians Day Conference 1973 (Q62033327) have a P7859 (P7859). --Mfchris84 (talk) 05:45, 25 March 2020 (UTC)
novalue
[edit]Is it valid to use special novalue setting in this property? There are known cases when item has VIAF or LCCN (or both) but any WorldCatId created using those values does not work. Paweł Ziemian (talk) 21:00, 27 March 2020 (UTC)
- There is dedicated category in plwiki "pl:Kategoria:Kontrola autorytatywna bez linku do WorldCat", which groups articles where WorldCatId is manually disabled in a template. Paweł Ziemian (talk) 21:05, 27 March 2020 (UTC)
- @Paweł Ziemian: The allowed values are shown in the format regex above. You are right that some VIAF don't have a WorldCat page of either kind. But I would not import "novalue" claims because new links are added continuously (see last section), and at some point we'd have to remove these claims. Instead, I'd watch for incorrect values and take measures against that. --Vladimir Alexiev (talk) 14:20, 10 April 2020 (UTC)
negative type constraint
[edit]Notified participants of WikiProject Authority control, also
@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:
I noticed people use P7859 (P7859) for books while they should use OCLC control number (P243). Eg see https://www.wikidata.org/w/index.php?title=Q48837289&action=history, and a full list of mistakes is below. Any takers to correct these? I fixed the first couple.
155331401 Q48837289 316763896 Q5223184 316942938 Q87140025 432822520 Q48837289 61935747 Q3236166 664802524 Q10316960 78751353 Q48837289 883743583 Q3391127 973542923 Q15595065
P7859 (P7859) already has type constraint (thanks @Mfchris84:). Can someone in the know add a negative type constraint to forbid the classes allowed for OCLC control number (P243)?
I added a note in the description --Vladimir Alexiev (talk) 14:14, 10 April 2020 (UTC)
initial load and refresh
[edit]Notified participants of WikiProject Authority control, also
@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:
I finished importing 1683978 WorldCat ID from VIAF 20191104. I did it with QuickStatements batches (86 batches of 40k statements or 20k identifiers each). It took 2 months and 1 week to process them (from 4 Feb 2020 to 10 Apr 2020). 30326 (1.8%) failed to insert on this initial load because high load causes QS errors (timeouts when adding statements, then the corresponding reference also fails).
I also collected 57 errors (exceptions) from WD editor reverts of my inserts. I put them down at https://en.wikipedia.org/wiki/Wikipedia:VIAF/errors#WorldCat_Identities_errors. I will submit these to OCLC for correction.
Now I plan to do a refresh.
- From WD I got 2072346 VIAF ids
- From WD I got 1683967 WorldCat ids: 1096206+587684 (lccn+viaf)
- From VIAF http://viaf.org/viaf/data/viaf-20200302-links.txt.gz (7Gb unzipped) I got 20194854 WorldCat ids: 8521769+11673083+2 (lccn+viaf+other). This is growth by 275671 (1.36%) in 4 months (compared to 20191104)
- WD items with VIAF but no WorldCat: 377995
- Intersect with VIAF-WorldCat and remove exceptions: 57962
- Submitted 3 QuickStatements batches with the following number of lines
20001 wd-identities-new-00 20001 wd-identities-new-01 17963 wd-identities-new-02
--Vladimir Alexiev (talk) 16:53, 10 April 2020 (UTC)
- The refresh is pretty much finished. The 3 batches are finishing up: I've done "clear errors" to add the failed references, which may have caused some duplicate statements, see below.
- I also reported the errors to OCLC though @Florentyna: is still adding more error reports. Cheers! --Vladimir Alexiev (talk) 09:03, 13 April 2020 (UTC)
avoiding duplicates
[edit]I got a number of reverts due to duplicates, so I'll try to fix them.
There are a number of "legitimate" duplicates due to several VIAF IDs. I wrote a query to find items that have two different statements but with the same value:
select ?x ?xLabel ?s1 ?s2 ?v { ?x p:P7859 ?s1,?s2 filter(str(?s1)<str(?s2)) ?s1 ps:P7859 ?v. ?s2 ps:P7859 ?v. } limit 100
1. I've created some due to QS race conditions: item is created, reference can't be created because it still doesn't see the new item; then I "Reset errors" on the 3 batches, which creates some duplicates. I won't "reset errors" anymore, so hopefully that'll stop
2. @Gamaliel: is creating 800 duplicates through adding qualifier "quantity", see https://www.wikidata.org/wiki/Topic:Vkf0s6bt3kzm14j0. So I tried to filter out this reason but the query times out:
select ?x ?xLabel ?s1 ?s2 ?v { ?x p:P7859 ?s1,?s2 filter(str(?s1)<str(?s2)) ?s1 ps:P7859 ?v. ?s2 ps:P7859 ?v. ?s1 ps:P7859 ?v filter not exists {?s1 pq:P1114 ?q1} ?s2 ps:P7859 ?v filter not exists {?s2 pq:P1114 ?q2} } limit 100
- @Gamaliel: is working to eliminate the duplicates. Out of the first 100 duplicates (query https://w.wiki/Msf), 16 don't have qualifier (I'm fixing those) and the rest have "quantity". --Vladimir Alexiev (talk) 07:53, 15 April 2020 (UTC)
- Will this query include the legitimate duplicates due to multiple VIAF IDs? If I can get a query that includes no false positives I can run a QS batch to eliminate them. Gamaliel (talk) 14:40, 19 April 2020 (UTC)
- @Gamaliel: This does not return legitimate duplicates: it looks for the same item ?x having the same WorldCat ?v through two different statements. You have to process them in slices of 100 else the query times out. It would be good if you could preserve my References but that's not crucial because the form of WorldCat id shows where it came from (VIAF or LCCN). Cheers! --Vladimir Alexiev (talk) 14:08, 20 April 2020 (UTC)
select ?x ?s1 ?s2 ?v ?q1 ?q2 {
?x p:P7859 ?s1,?s2
filter(str(?s1)<str(?s2))
?s1 ps:P7859 ?v.
?s2 ps:P7859 ?v.
optional{?s1 pq:P1114 ?q1}
optional{?s2 pq:P1114 ?q2}
} limit 100
- @Vladimir Alexiev: Thanks! All my queries on whatever topic seem to be timing out today so I'll wait until tomorrow to try this. Gamaliel (talk) 15:26, 20 April 2020 (UTC)
The "count" query on top suggests about 19k duplicates (the difference between values and items with value). @Gamaliel: Any progress? --Vladimir Alexiev (talk) 23:29, 30 April 2020 (UTC)
- I'm been running another unrelated process in Open Refine for a couple of days. I didn't think it would take this long, but I'll work on this as soon as that one is finished. Gamaliel (talk) 01:53, 1 May 2020 (UTC)
- @Vladimir Alexiev: Just ran a batch of 100 to test it out: https://tools.wmflabs.org/editgroups/b/QSv2T/1589562172614/ If everything looks good I will increase the number of edits with the next batch. Gamaliel (talk) 17:06, 15 May 2020 (UTC)
- @Gamaliel: the query now returns just 58 results, so you're close to eliminating all these duplicates. cheers! --Vladimir Alexiev (talk) 16:57, 24 May 2020 (UTC)
Conflations
[edit]Please see Wikidata:Project_chat#Question_about_conflation_in_WorldCat_identities --- Jura 05:38, 3 July 2020 (UTC)
- Now: Wikidata:Project chat/Archive/2020/07#Question about conflation in WorldCat identities. --Kolja21 (talk) 05:41, 5 April 2023 (UTC)
outdated WorldCat values
[edit]hi! by verifying hundred of WorldCat values together with thousend of authority control statements please note:
- these values change dynamically with passing of time
- especially values of the form "viaf-foo" may change to "lccn-bar". in many cases the links redirect in approximately 10 sec. sometimes they do not redirect. many values of th form "np-foo" are as well subject to hanges.
- the presence of the "Library of Congress ID" might be a strong indication that an update is required.
please leave a short note if you can take (partially) care on this issue. thanks in advance! kind regards gangleri aka lery raynhart aka 17:37, 24 November 2022 (UTC) no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs 17:37, 24 November 2022 (UTC)
redirect?
[edit]why not a redirect (to the suggested new one worldcat bla bla) instead of only the deleted property? GiovanniPen (talk) 22:53, 24 October 2024 (UTC)