Skip to content

Conversation

@LodiAleardo
Copy link

@LodiAleardo LodiAleardo commented Dec 11, 2025

Description

I've removed wrong wikidataID and imported some from wikidata and checked some (I could not check >7000). I think that having wikidataID=null is better than having wrong informations

What I have done

  1. Removed all wikidataID if one of this condition does not meet
    • English names do not match
    • Italian names do not match
  2. Downloaded the possibile wikidataID matching the labels
  3. If no new wikidataID could not be found I've decided to set the field to null

Small notes

I've noticed some errors at least for the Italian json file. I want to point out two of them, maybe in your pipeline you can catch some 🐛

  • In IT.json the entry with id 58239 named Palermiti has some non correct names for "pt-BR" and "pt"
  • In the same file the entity 58240 Palermo is not correctly located (lat, lon are completly wrong)

@dosubot dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. fixed Issue has been fixed labels Dec 11, 2025
@dr5hn dr5hn linked an issue Dec 13, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

fixed Issue has been fixed size:XS This PR changes 0-9 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Duplicate WikidataId for different cities

1 participant