User:Kbseah
Jump to navigation
Jump to search
Babel user information | ||||
---|---|---|---|---|
| ||||
Users by language |
Hello world!
Queries
[edit]- Labels and descriptions for ciliate classes
- Ciliate species without descriptions in Chinese
- Ciliate genera with the same taxon name (Find homonyms and duplicates)
- Ciliate genera with same taxon name as any other taxa (Find hemihomonyms)
- Ciliate families optionally with reference to first valid descriptions
- Works authored by Wilhelm Foissner where main subject is a taxon
- Eponymous ciliate taxa (list very incomplete)
- Filter ciliate taxa by regex on taxon author citation, optionally get existing taxon-author and year of publication qualifiers https://w.wiki/7HB4
- Ciliate species with French Wikipedia articles
- Ciliate genera without GBIF taxon ID
- Taxa not linked to reference with nomenclatural act but which have a Wikispecies page
- Retrieve all combinations of a given species (whether original combination or not)
- Retrieve 'taxon name' statements where reference is qualified with 'reference has role'
- Retrieve 'taxonomic type' statements where reference is qualified with 'reference has role'
- Articles published in the Journal of Eukaryotic Microbiology with "n. sp." in the title which are referenced in a taxon name statement with a specified role - uses the MWAPI to search labels
- Articles published in IJSEM with "n. sp." in the title with a taxon as a main subject
Use CirrusSearch for searches that would otherwise timeout with SPARQL, see.
- Articles in journals for fields protistology or phycology with main subject "colonialism" - probably mistakenly inferred from keyword "colonial" in the title
- Errata in IJSEM not linked to the article they correct
- Instances of errata published in PNAS whose titles are not in the form expected for errata published in this journal (mostly false positives, but some are real errata).
- Find errata not linked to original articles, extract original titles, and search for matching articles
- As above, but getting journals by publisher and title filter using subquery
- The above but using MWAPI -- case insensitive search
Wrongly reconciled/conflated topics in "main subject" statements
[edit]I think the wrongly reconciled main topic statements should be deprecated, rather than deleted. This is because they appear to be added by scripted edits. If they are deleted, then there is the possibility that they will simply be added again, but if deprecated, the script author may notice the edit conflict and change their search strategy accordingly.
- "colonial" in articles about biology wrongly reconciled with "colonialism". https://w.wiki/AdEN
- "dark matter" in articles about biology (buzzwords like "microbial dark matter") wrongly reconciled with "dark matter" (concept in physics). https://w.wiki/AdEK
Disambiguation needed
[edit]- Tianella - ciliate or diplopod Q39487350
- Hadziella - mollusc or ciliate Q18581905
- Linostomella - fungus or ciliate Q6554754
- Micromitra - brachiopod or ciliate Q16917654
- Trachelochaeta - fish or ciliate Q25361557 - the GBIF record has also wrong author for fish https://www.gbif.org/species/4907217
- Urostylinae - description says "insect" but this looks like a bad bot Q25361721
- Trichospira - plant or ciliate Q7841011
- Ophionella - plant or ciliate Q6051554
- Cataphractes - ciliate Q121484275 or insects Q5051437
- Trichospira - ciliate or plant Q7841011
- Panophrys - ciliate or amphibian Q107264714
- Ophionella - ciliate or plant Q6051554
- Butschliella Q23070191 algae vs. Buetschliella ciliates
- Epalxis - ciliates or snails Q101242432
- Faureia - ciliates or insects Q10494364
- Cercaria
- Cistula
- Hormidium - genus of orchids or genus of green algae, e.g. Q96054281
- Diplotheca - genus of choanoflagellates? But appears to be some conflation in IRMNG vs WORMS Q128214737
Disambiguated
[edit]- Gonostomum - ciliate Q121305424 or fish Q25361569 (typo for Gonostoma?) - GBIF issue
- Paragonostomum - ciliate Q121305789 or fish Q25361563 - GBIF issue
- Ptychostomum - ciliate Q121345365 or plant Q17283173 -- GBIF and EOL records need some cleaning
- Pelagothrix - ciliate Q121308738 or cyanobacteria Q61946088
- Sagittariidae - ciliate Q121031526 or bird Q2346527
- Lagynidae - ciliate Q121031455 or allogromid Q967963
- Wallackia - fish Q25361554 or ciliate Q121362456 - IRMNG record needs fixing; GBIF issue
- Urceolaria - ciliate or fungus or lichen or plant
- Lepidothrix - ciliate Q121484222 (later homonym??) or bird Q1129268
- Sagittaria venezuelensis - ciliate Q121484281 or plant Q111812714
- Enchelis - ciliate Q25360929 or Ochrophyta Q25409011
- Chilodon - ciliate Q121557928 or mollusc Q107306426
- Urnula - fungus Q4006482 or ciliate Q121597406 or foram Q121597450
- Hainesia - fungus Q10516070 or mollusc Q122226174
- Histrio - ciliate Q122228571 or fish Q1421921
- Membranicola - ciliate Q122231390 or bacteria Q62602627 - GBIF issue
- Craterella - ciliate Q122309538 or forams Q22943677 or sponges Q122309968 or fungi Q122310171
- Favella - ciliate Q94578391 or fossil ostracod Q23890433
- Bartramia - bird Q2789944 or moss Q66623235 or flowering plant Q809586
- Lacrymaria - ciliate Q6468898 or fungus Q1611947
To do
[edit]Fix titles of articles with scientific names wrongly processed
[edit]- Titles with scientific names that were erroneously removed. This appears to have been a bug in QuickStatementsBot: https://www.wikidata.org/w/index.php?title=Q60458631&oldid=1185683710
- Titles with scientific names not set off by spaces from surrounding words. Again appears to be a bug in bot, I think when removing HTML tags the spaces were removed too, causing the error.
- Encoding issues in titles and name strings: https://www.wikidata.org/w/index.php?title=Q54802341&oldid=1833160994
A hierarchy of tasks
[edit]- Generate labels and descriptions from statements within an item
- Link items within Wikidata using statements/labels of the items
- Link items in Wikidata to external identifiers in databases that are programmatically accessible
- Add statements to Wikidata based on sources that are not programmatically accessible
Extract structured data semi-manually from published works
[edit]Link Kofoid & Campbell, 1929 taxa to Q122310402- Genera of ciliates listed in Q95986077
- Protist taxa described by Saville-Kent, see Zoological Record
- Homonyms listed in Corliss 1960 Q103867709
- Genera of microsporidians listed in Q123498442
Data cleaning and linking
[edit]- Link taxa to identifiers in GBIF, IRMNG, NCBI, etc., matching higher taxa to avoid homonyms
- Link taxa to publications of their first description
- Find and disambiguate homonyms
- Add basic descriptions for taxa based on vernacular names of higher taxa (e.g. "species of green algae")
- Add vernacular names @zh for taxa from zhwiki sitelinks
- Link errata to the articles they correct by matching titles
Authors and basionyms for taxa of mosses
- Add taxon author citations to taxa of mosses Q25347 sourced from World Flora Online data export
- Parse taxon author citation and match to botanist author abbreviations, to add taxon author and ex taxon author qualifiers (if not already present)
- Parse taxon author citation to find items that are recombinations but without basionym statements; add basionyms statements sourced from World Flora Online
- Explore parsing abbreviated citations from World Flora Online to match taxa to first valid descriptions or other nomenclatural acts
Data modeling questions
[edit]- How to qualify reference where role is type designation?
- How to represent taxon authors as qualifiers of taxon name statement if no item for author exists? (e.g. names we can't disambiguate)
- How to represent order of taxon authors as qualifiers of taxon name statement?
- Better to have nomenclatural acts, taxonomic treatments, etc. as objects of "described by" statements? Then these statements themselves can be qualified and ranked. For example, if the first valid description was found from a third party citation or if its status is disputed.
KIV
[edit]- World Foraminifera Database https://www.marinespecies.org/foraminifera/index.php
- Journal title abbreviations in Zool Record
- What caused this?! https://www.wikidata.org/w/index.php?title=Q18519822&diff=prev&oldid=1753573010
- Study this query
- wat: Item:Q29011578