This commit is contained in:
Miloslav Ciz 2025-07-18 14:17:08 +02:00
parent cbd9182420
commit 895f96a007
7 changed files with 2032 additions and 1953 deletions

View file

@ -13,6 +13,7 @@ Also remember the worst thing you can do to a joke is put a [disclaimer](disclai
- [C++](cpp.md) - [C++](cpp.md)
- Why doesn't [C++](cpp.md) have [garbage collection](garbage_collection.md)? Because then it would have to collect itself. - Why doesn't [C++](cpp.md) have [garbage collection](garbage_collection.md)? Because then it would have to collect itself.
- Why is the maximum speed called terminal velocity? Because [GUI](gui.md)s are slow. - Why is the maximum speed called terminal velocity? Because [GUI](gui.md)s are slow.
- `body { background-color: salmon; }`
- What's the worst kind of [lag](lag.md)? Gulag. - What's the worst kind of [lag](lag.md)? Gulag.
- Ingame chat: "What's the country in the middle of north Africa?" [{BANNED}](niger.md) - Ingame chat: "What's the country in the middle of north Africa?" [{BANNED}](niger.md)
- What does a [Marxist](marxism.md) call C++? Class struggle. - What does a [Marxist](marxism.md) call C++? Class struggle.
@ -29,7 +30,7 @@ Also remember the worst thing you can do to a joke is put a [disclaimer](disclai
- Do you use [Emacs](emacs.md)? No, I already have a [waifu](waifu.md). - Do you use [Emacs](emacs.md)? No, I already have a [waifu](waifu.md).
- Do you use [Emacs](emacs.md)? No, I already have carpal tunnel. Etc. :D - Do you use [Emacs](emacs.md)? No, I already have carpal tunnel. Etc. :D
- What's the difference between [Elon Musk](elon_musk.md) and a bucket full of [shit](shit.md)? The bucket. - What's the difference between [Elon Musk](elon_musk.md) and a bucket full of [shit](shit.md)? The bucket.
- Why do [Jews](jew.md) have such big noses? Because air is free. - Why do [Jews](jew.md) have such big noses? Because air is [free](gratis.md).
- `alias bitch=sudo` - `alias bitch=sudo`
- What's a trilobyte? 8 trilobits. - What's a trilobyte? 8 trilobits.
- "Never test for a bug that you don't know how to fix." --manager; "If we cannot fix it, it isn't broken." --also manager - "Never test for a bug that you don't know how to fix." --manager; "If we cannot fix it, it isn't broken." --also manager

View file

@ -141,7 +141,7 @@
| [woman](woman.md) | femoid | | [woman](woman.md) | femoid |
| [work](work.md) | slavery | | [work](work.md) | slavery |
| [world wide web](www.md) | world wide wait | | [world wide web](www.md) | world wide wait |
| [YouTube](youtube.md) | JewTube | | [YouTube](youtube.md) | JewTube, ThemTube |
Words you should use often include: abomination, crazy, cretin, [faggot](faggot.md), fat, femoid, idiot, imbecile, insane, landwhale, moron, [nigger](nigger.md), [retard](retard.md), subhuman, ugly. Words you should use often include: abomination, crazy, cretin, [faggot](faggot.md), fat, femoid, idiot, imbecile, insane, landwhale, moron, [nigger](nigger.md), [retard](retard.md), subhuman, ugly.

File diff suppressed because it is too large Load diff

2
usa.md

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

View file

@ -3,12 +3,12 @@
This is an autogenerated article holding stats about this wiki. This is an autogenerated article holding stats about this wiki.
- number of articles: 644 - number of articles: 644
- number of commits: 1042 - number of commits: 1043
- total size of all texts in bytes: 5648348 - total size of all texts in bytes: 5650398
- total number of lines of article texts: 40512 - total number of lines of article texts: 40525
- number of script lines: 324 - number of script lines: 324
- occurrences of the word "person": 10 - occurrences of the word "person": 10
- occurrences of the word "nigger": 165 - occurrences of the word "nigger": 166
longest articles: longest articles:
@ -35,37 +35,37 @@ longest articles:
top 50 5+ letter words: top 50 5+ letter words:
- which (3024) - which (3026)
- there (2385) - there (2386)
- people (2250) - people (2250)
- example (1948) - example (1950)
- other (1732) - other (1733)
- about (1540) - about (1540)
- number (1478) - number (1481)
- software (1341) - software (1341)
- because (1275) - because (1275)
- their (1200) - their (1200)
- something (1182) - something (1182)
- would (1157) - would (1158)
- being (1137) - being (1137)
- program (1092) - program (1093)
- language (1041) - language (1041)
- called (1013) - called (1014)
- things (960) - things (961)
- without (941) - without (941)
- simple (908) - simple (908)
- function (888) - function (888)
- numbers (884) - numbers (884)
- computer (880) - computer (881)
- different (863) - different (863)
- world (831) - world (832)
- these (821) - these (821)
- however (815) - however (815)
- programming (814) - programming (814)
- should (794) - should (795)
- still (787) - still (787)
- system (781) - system (781)
- doesn (759) - doesn (760)
- always (746) - always (746)
- drummyfish (744) - drummyfish (744)
- games (743) - games (743)
@ -79,9 +79,9 @@ top 50 5+ letter words:
- using (669) - using (669)
- someone (659) - someone (659)
- course (655) - course (655)
- similar (643) - similar (644)
- actually (640) - actually (641)
- first (630) - first (631)
- value (621) - value (621)
- though (600) - though (600)
- really (599) - really (599)
@ -89,6 +89,20 @@ top 50 5+ letter words:
latest changes: latest changes:
``` ```
Date: Sun Jul 13 17:57:29 2025 +0200
anarch.md
drummyfish.md
encyclopedia.md
human_language.md
information.md
main.md
nigger.md
random_page.md
soyence.md
wiki_pages.md
wiki_stats.md
wikidata.md
woman.md
Date: Thu Jul 10 23:36:36 2025 +0200 Date: Thu Jul 10 23:36:36 2025 +0200
abstraction.md abstraction.md
disease.md disease.md
@ -108,20 +122,6 @@ Date: Tue Jul 8 23:03:00 2025 +0200
lrs_wiki.md lrs_wiki.md
racetrack.md racetrack.md
random_page.md random_page.md
wiki_pages.md
wiki_stats.md
woman.md
Date: Sun Jul 6 09:01:22 2025 +0200
acronym.md
free_speech.md
licar.md
lrs_wiki.md
main.md
nigger.md
random_page.md
rust.md
science.md
wiki_pages.md
``` ```
most wanted pages: most wanted pages:
@ -152,7 +152,7 @@ most popular and lonely pages:
- [lrs](lrs.md) (357) - [lrs](lrs.md) (357)
- [capitalism](capitalism.md) (329) - [capitalism](capitalism.md) (329)
- [bloat](bloat.md) (254) - [bloat](bloat.md) (254)
- [c](c.md) (251) - [c](c.md) (252)
- [free_software](free_software.md) (211) - [free_software](free_software.md) (211)
- [game](game.md) (167) - [game](game.md) (167)
- [suckless](suckless.md) (152) - [suckless](suckless.md) (152)
@ -162,7 +162,7 @@ most popular and lonely pages:
- [computer](computer.md) (131) - [computer](computer.md) (131)
- [kiss](kiss.md) (128) - [kiss](kiss.md) (128)
- [censorship](censorship.md) (128) - [censorship](censorship.md) (128)
- [fun](fun.md) (125) - [fun](fun.md) (126)
- [math](math.md) (124) - [math](math.md) (124)
- [shit](shit.md) (122) - [shit](shit.md) (122)
- [programming](programming.md) (121) - [programming](programming.md) (121)

View file

@ -6,7 +6,7 @@ It should be noted that Wikidata is incredibly useful but a bit unfairly overloo
Wikidata was opened on 30 October 2012. The first data that were stored were links between different language versions of Wikipedia articles, later Wikipedia started to use Wikidata to store information to display in infoboxes in articles and so Wikidata grew and eventually became a database of its own. As of 2022 there is a little over 100 million items, over 1 billion statements and over 20000 active users. The database dump in [json](json.md), [COMPRESSED](compression.md) with gzip, takes gargantuous 130 GB. Wikidata was opened on 30 October 2012. The first data that were stored were links between different language versions of Wikipedia articles, later Wikipedia started to use Wikidata to store information to display in infoboxes in articles and so Wikidata grew and eventually became a database of its own. As of 2022 there is a little over 100 million items, over 1 billion statements and over 20000 active users. The database dump in [json](json.md), [COMPRESSED](compression.md) with gzip, takes gargantuous 130 GB.
The first items added to the database were the [Universe](universe.md), [Earth](earth.md), [life](life.md), [death](death.md), [human](people.md) etc. Some cool items include [nigger](nigger.md) (Q1455718), fart (Q5436447), [LMAO](lmao.md) (Q103319444), [Anarch](anarch.md) (Q114540914) and [this very wiki](lrs_wiki.md) (Q116266837). The structure of the database actually suggests that apart from the obvious usefulness of the data itself we may also toy around with this stuff in other [fun](fun.md) ways, for example we can use wikidata to give a hint of significance of any thing or concept -- given that two similar things predate wikidata itself, we may assume that the one with lower number is likely more significant for having been added earlier. For instance a [dog](dog.md)'s serial number is 144 and [cat](cat.md)'s is 146, so a dog would "win" this kind of internet battle by a tiny margin. Alternatively we can compare the size of the items' records to decide which one wins in significance. Here dog wins again with 200 kilobytes versus cat's 196 kilobytes. The first items added to the database were the [Universe](universe.md), [Earth](earth.md), [life](life.md), [death](death.md), [human](people.md) etc. Some cool items include [nigger](nigger.md) (Q1455718), fuck her right in the pussy (Q105676108), fart (Q5436447), [LMAO](lmao.md) (Q103319444), [Anarch](anarch.md) (Q114540914) and [this very wiki](lrs_wiki.md) (Q116266837). The structure of the database actually suggests that apart from the obvious usefulness of the data itself we may also toy around with this stuff in other [fun](fun.md) ways, for example we can use wikidata to give a hint of significance of any thing or concept -- given that two similar things predate wikidata itself, we may assume that the one with lower number is likely more significant for having been added earlier. For instance a [dog](dog.md)'s serial number is 144 and [cat](cat.md)'s is 146, so a dog would "win" this kind of internet battle by a tiny margin. Alternatively we can compare the size of the items' records to decide which one wins in significance. Here dog wins again with 200 kilobytes versus cat's 196 kilobytes.
## Database Structure ## Database Structure
@ -21,38 +21,116 @@ The most important properties are probably **instance of** (P31) and **subclass
## How To ## How To
There are many [libraries](library.md)/[APIs](api.md) for wikidata you can use, unlike shitty corporations that guard their data by force wikidata provides data in friendly ways -- you can even download the whole wikidata database in [JSON](json.md) format (about 100 GB). Many [libraries](library.md)/[APIs](api.md)/tools exist for accessing wikidata because, unlike shitty [corporations](corporation.md) who guard and obfuscate their data by force, wikidata provides data in friendly ways -- you can even download the whole database dump in several formats including simple ones such as [JSON](json.md) (about 100 GB).
The easiest way to retrieve just the data you are interested in is probably going to the online query interface (https://query.wikidata.org/), entering a query (in [SPARQL](sparql.md) language, similar to [SQL](sql.md)) and then clicking download data -- you can choose several formats, e.g. [JSON](json.md), [CSV](csv.md) etc. That can then be processed further with whatever language or tool, be it [Python](python.md), [LibreOffice](libreoffice.md) Calc etc. Arguably the easiest way to grab some smaller data is through the online query interface (https://query.wikidata.org/), entering a query (in [SPARQL](sparql.md) language, similar to [SQL](sql.md)) and then clicking download data -- you can choose several formats, e.g. [JSON](json.md) or [CSV](csv.md). That can then be processed further with whatever language or tool, be it [Python](python.md), [LibreOffice](libreoffice.md) Calc etc.
**BEWARE**: the query you enter may easily take a long time to execute and time out, you need to write it nicely which for more complex queries may be difficult if you're not familiar with SPARQL. However wikidata offers online tips on [optimization](optimization.md) of queries and there are many examples right in the online interface which you can just modify to suit you. **BEWARE**: the query you enter may easily take a long time to execute and time out, you need to write it nicely which for more complex queries may be difficult if you're not familiar with SPARQL. However wikidata offers online tips on [optimization](optimization.md) of queries and there are many examples right in the online interface which you can just modify to suit you. Putting a limit on the number of results usually helps, also try to reorder the conditions and so on.
Here are some example of possible queries. The following one selects video [games](game.md) of the [FPS](fps.md) genre: Now finally on to a few actual examples. The first one will show one of the most basic and common queries: just listing items with certain properties, specifically video [games](game.md) of the [FPS](fps.md) genre here:
``` ```
SELECT ?item ?itemLabel WHERE SELECT ?item ?itemLabel ?itemDescription WHERE
{ {
?item wdt:P31 wd:Q7889. # item is video game and ?item wdt:P31 wd:Q7889. # item is a video game and
?item wdt:P136 wd:Q185029. # item is FPS ?item wdt:P136 wd:Q185029. # item is FPS
# this gets the item label: # this gets item labels (you can append "Label" or "Description" to any requested variable now):
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". } SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
} }
LIMIT 100 # limit to 100 results, make the query faster LIMIT 100 # limit to 100 results, make the query faster
``` ```
Another query may be this one: select [black holes](black_hole.md) along with their mass (where known): The language is somewhat intuitive, you basically enter conditions and the database then searches for records that satisfy them, but if it looks hard just see some tutorial.
OK, how about some lulz now? Let's search for human [races](race.md), then count them and compute their average, minimum and maximum height:
``` ```
SELECT ?item ?itemLabel ?mass WHERE SELECT ?race ?raceLabel ?raceDescription (COUNT(?human) AS ?count) (AVG(?height) AS ?averageHeight) (MAX(?height) AS ?maxHeight) (MIN(?height) AS ?minHeight) WHERE
{ {
{ ?item wdt:P31 wd:Q589. } # instances of black hole { # subquery for optimization (delaying label retrieval)
UNION SELECT ?human ?race ?height WHERE
{ ?item wdt:P31 ?class. # instance of black hole subclass (e.g. supermassive blackhole, ...) {
?class wdt:P279 wd:Q589. } ?human wdt:P31 wd:Q5. # is human
?human wdt:P172 ?race. # has race
OPTIONAL { ?item wdt:P2067 ?mass } # has height in centimetres:
?human p:P2048 ?st1.
?st1 psv:P2048 ?vn1.
?vn1 wikibase:quantityAmount ?height.
?vn1 wikibase:quantityUnit wd:Q174728.
} LIMIT 10000
}
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". } SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],mul,en". }
} GROUP BY ?race ?raceLabel ?raceDescription ORDER BY DESC(?count)
```
Current this returned 331 races, the most frequent (in the database) being "[African American](nigger.md)" with average height 181 cm, then White Americans (171 cm), White People (167 cm) etc. Now let's shit on [privacy](privacy.md) and make an [NSA](nsa.md) style database or people along with personal data such as their names, birth and death dates, causes of death etc.:
```
SELECT ?human ?humanLabel ?humanDescription ?sexLabel ?birthDate ?birthPlaceLabel ?deathDate ?deathCauseLabel
WITH
{
SELECT ?human ?birthDate ?birthPlace ?sex ?deathDate ?deathCause WHERE
{
?human wdt:P31 wd:Q5.
?human wdt:P569 ?birthDate.
?human wdt:P19 ?birthPlace.
?human wdt:P21 ?sex.
OPTIONAL { ?human wdt:P570 ?deathDate. }
OPTIONAL { ?human wdt:P509 ?deathCause. }
FILTER (?birthDate >= "1000-01-01T00:00:00Z"^^xsd:dateTime)
} LIMIT 10000
} AS %data
WHERE
{
INCLUDE %data
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],mul,en" .
}
} }
``` ```
Cool, works pretty nicely. Another [interesting](interesting.md) query may be one about [languages](human_language.md), counting their grammatical cases, tenses etc.:
```
SELECT ?languageLabel ?nativeName ?typeLabel ?countryLabel ?writingLabel ?code1 ?code2 ?speakers ?cases ?tenses ?genders WHERE
{
{
SELECT
?language
(MAX(?nn) AS ?nativeName)
(MAX(?ws) AS ?writing)
(MAX(?sp) AS ?speakers)
(MAX(?c1) AS ?code1)
(MAX(?c2) AS ?code2)
(MAX(?ty) AS ?type)
(MAX(?co) AS ?country)
?cases
?tenses
?genders
WHERE
{
{ ?language wdt:P31 wd:Q33742. } UNION { ?language wdt:P31 wd:Q20162172. } UNION { ?language wdt:P31 wd:Q33215. } # is one of these
OPTIONAL{?language wdt:P1098 ?sp. }
OPTIONAL{?language wdt:P1705 ?nn.}
OPTIONAL{?language wdt:P282 ?ws.}
OPTIONAL{?language wdt:P218 ?c1.}
OPTIONAL{?language wdt:P219 ?c2.}
OPTIONAL{?language wdt:P279 ?ty.}
OPTIONAL{?language wdt:P2341 ?co.}
OPTIONAL{ SELECT ?language (COUNT(?tmp) AS ?cases) WHERE { ?language wdt:P2989 ?tmp. } GROUP BY ?language }
OPTIONAL{ SELECT ?language (COUNT(?tmp) AS ?tenses) WHERE { ?language wdt:P3103 ?tmp. } GROUP BY ?language }
OPTIONAL{ SELECT ?language (COUNT(?tmp) AS ?genders) WHERE { ?language wdt:P5109 ?tmp. } GROUP BY ?language }
} GROUP BY ?language ?cases ?tenses ?genders ?countries
}
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],mul,en". } # here assign label
} ORDER BY ?speakers
```
This currently returns 1309 languages, French with most tenses (21) and Hungarian with most cases (24).
## See Also
- [encyclopedia](encyclopedia.md)
- [database](database.md)