Property talk:P7859

From Wikidata
Jump to navigation Jump to search

Documentation

Lua error in Module:Property_documentation at line 1669: Entity not found.

[create Create a translatable help page (preferably in English) for this property to be included here]

Lua error in Module:Constraints at line 864: attempt to index local 'propertyentity' (a nil value).

Value doesn't match P214
value starting with "viaf-" doesn't match VIAF ID (P214)-value. (Help)
Violations query: SELECT ?item ?value { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P214 ?viaf . BIND( concat("viaf-", ?viaf ) as ?viafvalue) } FILTER NOT EXISTS { ?item wdt:P7859 ?viafvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?viafvalue } FILTER ( regex ( ?value, "^viaf-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P214
Value doesn't match P244
value starting with "lccn-" doesn't match Library of Congress authority ID (P244)-value. Some false positives: need to replace -%d with %06d. (Help)
Violations query: SELECT ?item ?value ?lccn { ?item wdt:P7859 ?value . OPTIONAL { ?item wdt:P244 ?lccn . BIND( concat("lccn-", ?lccn ) as ?lccnvalue ) } FILTER NOT EXISTS { ?item wdt:P7859 ?lccnvalue } FILTER NOT EXISTS { ?item p:P7859 / ps:P7859 ?lccnvalue } FILTER ( regex ( ?value, "^lccn-.*" ) ) } LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P7859#Value doesn't match P244

Conflations

[edit]


type constraint

[edit]

I added a new type constraint for the class event (Q1656682) because also conferences like German Librarians Day Conference 1973 (Q62033327) have a P7859 (P7859). --Mfchris84 (talk) 05:45, 25 March 2020 (UTC)[reply]

novalue

[edit]

Is it valid to use special novalue setting in this property? There are known cases when item has VIAF or LCCN (or both) but any WorldCatId created using those values does not work. Paweł Ziemian (talk) 21:00, 27 March 2020 (UTC)[reply]

negative type constraint

[edit]
Vladimir Alexiev Jonathan Groß Andy Mabbett Jneubert Sic19 Wikidelo ArthurPSmith PKM Ettorerizza Fuzheado Daniel Mietchen Iwan.Aucamp Epìdosis Sotho Tal Ker Bargioni Carlobia Pablo Busatto Matlin Msuicat Uomovariabile Silva Selva 1-Byte Alessandra.Moi CamelCaseNick Songceci moz AhavaCohen Kolja21 RShigapov Jason.nlw MasterRus21thCentury NGOgo Pierre Tribhou Ahatd JordanTimothyJames Silviafanti Back ache AfricanLibrarian M.roszkowski Rhagfyr 沈澄心 MrBenjo S.v.Mering Hiperterminal (talk) מקף Lovelano Ecravo Chado07 Soufiyouns

Notified participants of WikiProject Authority control, also

@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:

I noticed people use P7859 (P7859) for books while they should use OCLC control number (P243). Eg see https://www.wikidata.org/w/index.php?title=Q48837289&action=history, and a full list of mistakes is below. Any takers to correct these? I fixed the first couple.

155331401	Q48837289
316763896	Q5223184
316942938	Q87140025
432822520	Q48837289
61935747	Q3236166
664802524	Q10316960
78751353	Q48837289
883743583	Q3391127
973542923	Q15595065

P7859 (P7859) already has type constraint (thanks @Mfchris84:). Can someone in the know add a negative type constraint to forbid the classes allowed for OCLC control number (P243)?

I added a note in the description --Vladimir Alexiev (talk) 14:14, 10 April 2020 (UTC)[reply]

initial load and refresh

[edit]
Vladimir Alexiev Jonathan Groß Andy Mabbett Jneubert Sic19 Wikidelo ArthurPSmith PKM Ettorerizza Fuzheado Daniel Mietchen Iwan.Aucamp Epìdosis Sotho Tal Ker Bargioni Carlobia Pablo Busatto Matlin Msuicat Uomovariabile Silva Selva 1-Byte Alessandra.Moi CamelCaseNick Songceci moz AhavaCohen Kolja21 RShigapov Jason.nlw MasterRus21thCentury NGOgo Pierre Tribhou Ahatd JordanTimothyJames Silviafanti Back ache AfricanLibrarian M.roszkowski Rhagfyr 沈澄心 MrBenjo S.v.Mering Hiperterminal (talk) מקף Lovelano Ecravo Chado07 Soufiyouns

Notified participants of WikiProject Authority control, also

@ديفيد عادل وهبة خليل 2, Epìdosis, Salgo60, Animalparty, Jura1, ArthurPSmith:

I finished importing 1683978 WorldCat ID from VIAF 20191104. I did it with QuickStatements batches (86 batches of 40k statements or 20k identifiers each). It took 2 months and 1 week to process them (from 4 Feb 2020 to 10 Apr 2020). 30326 (1.8%) failed to insert on this initial load because high load causes QS errors (timeouts when adding statements, then the corresponding reference also fails).

I also collected 57 errors (exceptions) from WD editor reverts of my inserts. I put them down at https://en.wikipedia.org/wiki/Wikipedia:VIAF/errors#WorldCat_Identities_errors. I will submit these to OCLC for correction.

Now I plan to do a refresh.

  • From WD I got 2072346 VIAF ids
  • From WD I got 1683967 WorldCat ids: 1096206+587684 (lccn+viaf)
  • From VIAF http://viaf.org/viaf/data/viaf-20200302-links.txt.gz (7Gb unzipped) I got 20194854 WorldCat ids: 8521769+11673083+2 (lccn+viaf+other). This is growth by 275671 (1.36%) in 4 months (compared to 20191104)
  • WD items with VIAF but no WorldCat: 377995
  • Intersect with VIAF-WorldCat and remove exceptions: 57962
  • Submitted 3 QuickStatements batches with the following number of lines
  20001 wd-identities-new-00
  20001 wd-identities-new-01
  17963 wd-identities-new-02

--Vladimir Alexiev (talk) 16:53, 10 April 2020 (UTC)[reply]

  • The refresh is pretty much finished. The 3 batches are finishing up: I've done "clear errors" to add the failed references, which may have caused some duplicate statements, see below.
  • I also reported the errors to OCLC though @Florentyna: is still adding more error reports. Cheers! --Vladimir Alexiev (talk) 09:03, 13 April 2020 (UTC)[reply]

avoiding duplicates

[edit]

I got a number of reverts due to duplicates, so I'll try to fix them.

There are a number of "legitimate" duplicates due to several VIAF IDs. I wrote a query to find items that have two different statements but with the same value:

select ?x ?xLabel ?s1 ?s2 ?v {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
} limit 100

1. I've created some due to QS race conditions: item is created, reference can't be created because it still doesn't see the new item; then I "Reset errors" on the 3 batches, which creates some duplicates. I won't "reset errors" anymore, so hopefully that'll stop

2. @Gamaliel: is creating 800 duplicates through adding qualifier "quantity", see https://www.wikidata.org/wiki/Topic:Vkf0s6bt3kzm14j0. So I tried to filter out this reason but the query times out:

select ?x ?xLabel ?s1 ?s2 ?v {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
  ?s1 ps:P7859 ?v filter not exists {?s1 pq:P1114 ?q1}
  ?s2 ps:P7859 ?v filter not exists {?s2 pq:P1114 ?q2}
} limit 100
  • Will this query include the legitimate duplicates due to multiple VIAF IDs? If I can get a query that includes no false positives I can run a QS batch to eliminate them. Gamaliel (talk) 14:40, 19 April 2020 (UTC)[reply]
  • @Gamaliel: This does not return legitimate duplicates: it looks for the same item ?x having the same WorldCat ?v through two different statements. You have to process them in slices of 100 else the query times out. It would be good if you could preserve my References but that's not crucial because the form of WorldCat id shows where it came from (VIAF or LCCN). Cheers! --Vladimir Alexiev (talk) 14:08, 20 April 2020 (UTC)[reply]
select ?x ?s1 ?s2 ?v ?q1 ?q2 {
  ?x p:P7859 ?s1,?s2
  filter(str(?s1)<str(?s2))
  ?s1 ps:P7859 ?v.
  ?s2 ps:P7859 ?v.
  optional{?s1 pq:P1114 ?q1}
  optional{?s2 pq:P1114 ?q2}
} limit 100
Try it!

The "count" query on top suggests about 19k duplicates (the difference between values and items with value). @Gamaliel: Any progress? --Vladimir Alexiev (talk) 23:29, 30 April 2020 (UTC)[reply]

I'm been running another unrelated process in Open Refine for a couple of days. I didn't think it would take this long, but I'll work on this as soon as that one is finished. Gamaliel (talk) 01:53, 1 May 2020 (UTC)[reply]
@Vladimir Alexiev: Just ran a batch of 100 to test it out: https://tools.wmflabs.org/editgroups/b/QSv2T/1589562172614/ If everything looks good I will increase the number of edits with the next batch. Gamaliel (talk) 17:06, 15 May 2020 (UTC)[reply]
@Gamaliel: the query now returns just 58 results, so you're close to eliminating all these duplicates. cheers! --Vladimir Alexiev (talk) 16:57, 24 May 2020 (UTC)[reply]

Conflations

[edit]

Please see Wikidata:Project_chat#Question_about_conflation_in_WorldCat_identities --- Jura 05:38, 3 July 2020 (UTC)[reply]

Now: Wikidata:Project chat/Archive/2020/07#Question about conflation in WorldCat identities. --Kolja21 (talk) 05:41, 5 April 2023 (UTC)[reply]

outdated WorldCat values

[edit]

hi! by verifying hundred of WorldCat values together with thousend of authority control statements please note:

these values change dynamically with passing of time
especially values of the form "viaf-foo" may change to "lccn-bar". in many cases the links redirect in approximately 10 sec. sometimes they do not redirect. many values of th form "np-foo" are as well subject to hanges.
the presence of the "Library of Congress ID" might be a strong indication that an update is required.

please leave a short note if you can take (partially) care on this issue. thanks in advance! kind regards gangleri aka lery raynhart aka 17:37, 24 November 2022 (UTC) no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs no bias — קיין אומוויסנדיקע פּרעפֿערענצן — keyn umvisndike preferentsn talk contribs 17:37, 24 November 2022 (UTC)[reply]

redirect?

[edit]

why not a redirect (to the suggested new one worldcat bla bla) instead of only the deleted property? GiovanniPen (talk) 22:53, 24 October 2024 (UTC)[reply]