This is an archive of past discussions. Do not edit the contents of this page.
If you wish to start a new discussion or revive an old one, please do so on the current talk page.
Abstract Wikipedia should use a different mw-wiki-logo than Wikipedia for Vector 2010 skin
Hello friends. I've been asked to start a community conversation about this. I'd like to propose that Abstract Wikipedia create its own logo, so that folks visiting this wiki don't get it mixed up with Wikipedia. Even though this project is under the Wikipedia domain, I think it's pretty unique and it'd make sense to make sure it doesn't get mixed up with enwiki or other Wikipedias by newbies googling for Wikipedia articles. Thoughts? Thanks. Novem Linguae (talk) 12:02, 1 May 2026 (UTC)
Latest comment: 1 month ago1 comment1 person in discussion
The Wikidata tool Reasonator already automatically creates several sentences in English describing human Wikidata items using auto_long_desc.js under a GPL 2.0 free software license without using any large language model or other AI.
Latest comment: 1 month ago7 comments4 people in discussion
Might be a stupid question but what articles are/will be allowed on Abstract Wikipedia? How far is it meant to expand? The language Wikipedias have some differing policies, so might not be as simple as copying those. Personally I find the idea of creating an article for any and every Wikidata item really cool and a good baseline for what can have an article, but wouldn't ~120 million abstract articles become unwieldy? If the only requirement is that the article topic has a Wikidata item, then there are many interesting possibilities; one could write about individual dates, Wikidata test items, even Wikimedia disambiguation pages. Have not found where this is explained, if anywhere. This may be up to common sense, but trouble is, couldn't one create an encyclopedically meaningful article on just about anything? Pretty important policy not to have if it's undecided, though is not a problem at the moment. Just wondering. Some helpful person (talk) 23:36, 1 May 2026 (UTC)
Ah, I forgot about that page. So it is meant to abstract information from existing Wikipedia articles? Makes sense, though it still raises the question of creating new articles. Some helpful person (talk) 00:41, 2 May 2026 (UTC)
At this juncture, I don't think anyone has in mind drafting anything new here as such: this is just responding to existing Wikidata items and the possibility of new Wikipedia articles drafted from them, using Wikifunctions. Koavf (talk) 00:51, 2 May 2026 (UTC)
I don't think we ever get any article an existing wiki does not have. Even if abstract content creation gets as fast as reasonably possible, it will never beat out typing text, so we will never catch up with, say, enwiki or eswiki. Feeglgeef (talk) 01:10, 2 May 2026 (UTC)
It's also a question of policy. There are topics about which things could be said, but which wouldn't meet the baseline notability or other criteria of the big monolingual wikis. Arlo Barnes (talk) 20:22, 2 May 2026 (UTC)
This is true, if we keep d:'s notability policy (which seems like the most natural one) then there are many subjects that we theoretically can talk about that don't meet the GNG and similar guidelines. Feeglgeef (talk) 22:25, 2 May 2026 (UTC)
How many articles in how many languages are actually fully available without errors
The most important thing to count is already listed there: How many articles in how many languages are actually fully available without errors? It should be not just a count, but a list: I want to see which articles are readable in which languages; which articles are readable in some languages, but not others (which ones?); which articles are not readable in any language; etc.
An even more relevant, but much harder to measure thing is how many of those articles are actually more useful than not having an article at all. There is no article about Boston in many languages, but Q100, which currently says "Boston is the capital of Massachusetts.Boston is the largest city of Massachusetts." is not significantly more useful than nothing at all even if it's fully rendered in another language in which there is no concrete article about it. Amir E. Aharoni (talk) 12:12, 2 May 2026 (UTC)
The status update is about two projects and so I can understand if there are comments here. What do you think about creating a status update overview page with links to the status updates in Abstract Wikipedia to offer the possibility to discuss the status updates and especially things related to Abstract Wikipedia there. Hogü-456 (talk) 18:57, 4 May 2026 (UTC)
I don't think that's necessary. You can just include your abstractwiki (as in, the live wiki, not the project) related comments on the page on f:, or, if for some reason that's not possible, as a reply to the newsletter announcement here. The point is, the project chat is crowded enough with topics as it is, ideally it should be more focussed on general matters of discussion than close-ended enquiries about topics that have their own talk page. I find Amire80's use of a new == heading here to be improper. Feeglgeef (talk) 19:32, 4 May 2026 (UTC)
Wikifunctions & Abstract Wikipedia Newsletter #246 is out: Request for input: what should we count for Abstract Wikipedia
Latest comment: 1 month ago1 comment1 person in discussion
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we ask you what would be the relevant metrics for Abstract Wikipedia, we discuss our latest news on Composition Language v2, and we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!
Again, this should definitely have been on the talk page for that status update. This isn't the everything page, it should be used for discussion relating to the wiki. Feeglgeef (talk) 18:26, 2 May 2026 (UTC)
It's not a question just about the status update, even if it was prompted by it. It's a question about Abstract Wikipedia that can be relevant beyond the status update.
Stop policing how people use discussion pages without a particularly good reason. If you don't have an answer to the actual question, don't write anything. Amir E. Aharoni (talk) 19:38, 2 May 2026 (UTC)
I have to agree with Feeglgeef that this would be better raised on the status update's talk page itself. —rae5e<talk>19:42, 2 May 2026 (UTC)
I'm "policing" which discussion pages are being used for the same reason different discussion spaces exist in the first place. And your distinction between "just about" and "prompted by" is irrelevant. If I want to ask someone how they did something, I would ask them first, not go to a general discussion board. This is still the project chat, not a Q&A zone. @DVrandecic (WMF), can you please answer the question? Feeglgeef (talk) 22:22, 2 May 2026 (UTC)
I have a local script that I have run on a local copy. Nothing more complicated than that. Could also be very wrong -- the script hasn't been peer-reviewed or anything. --DVrandecic (WMF) (talk) 08:54, 4 May 2026 (UTC)
Latest comment: 1 month ago7 comments4 people in discussion
I was trying out a few functions recently, such as the new superlative function, and I noticed that every adjective has to be a Wikidata item. For example, I couldn't type in the inputs "Bugatti Veyron", "fast", "Earth" because "fast" as an adjective is not a Wikidata item. This doesn't really make sense, given that Wiktionary already has the word for "fast" and its superlative form in many different languages. Would it be possible to integrate Wiktionary into the function, so the user can type an adjective or verb from Wiktionary instead of having to deal with Wikidata (which consists almost entirely of nouns)? Somepinkdude (talk) 16:49, 6 May 2026 (UTC)
No, sadly. I would try "speed" as the adjective, though, you have to use abstract qualities instead of concrete English terms. Here's an example of the word being used.Feeglgeef (talk) 18:02, 6 May 2026 (UTC)
I tried this, and it gave "Bugatti Veyron is the speedest car on Earth", which is laughably bad. The problem is that Wikidata does not "know" what the superlative form is, while Wiktionary does. Speed is a fairly common descriptor, but what about less common adjectives which aren't on Wikidata? Will there need to be a bot to copy all of this information from Wiktionary? Somepinkdude (talk) 22:14, 6 May 2026 (UTC)
Well, no, the superlative form is chosen by f:Z12203. I think there's a bug with the English implementation that calls that function wrong. Feeglgeef (talk) 22:19, 6 May 2026 (UTC)
@Somepinkdude I did find the item fast (Q19807466) which was linked to a concept database's entry for "fast"; I have at least linked it to the English lexeme "fast" and will try to add more links at a better hour. Mahir256 (talk) 04:18, 7 May 2026 (UTC)
Unfortunately not, and, honestly, I wouldn't recommend article creation at the moment, it's likely to need significant refactoring as the way we represent abstract content changes, but, I can point you to Q668 as a reasonably good example at the moment. Feeglgeef (talk) 02:39, 8 May 2026 (UTC)
308 is the number of concrete Wikipedias in which a manually-written article about India is available.
The abstract article renders for me in English as "India is a country in Asia. India is a republic. New Delhi is the capital of India. India is the most populous country in world." Switching to another language to get rendered as an abstract article is done by typing the language's name in the box above the rendered text. I tried French, German, and Dutch, and got errors in all of them. Amir E. Aharoni (talk) 03:12, 8 May 2026 (UTC)
So it goes. The paragraph has quite a lot of distinct sentence generation functions, all of which of course have to be implemented in the target language. This will be rounded out once we're able to more effectively generalize linguistic content. I view these functions as sort of patchy temporary solutions to the sentence generation problem while we figure out something more robust, which I hope is soon to come. —rae5e<talk>04:15, 8 May 2026 (UTC)
Wikifunctions & Abstract Wikipedia Newsletter #247 is out: References from Wikidata now available
Latest comment: 1 month ago1 comment1 person in discussion
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we announce that is now possible to pass references in Wikidata statements, we introduce the Abstract Data dashboard, we report you on the presentation about Abstract Wikipedia at WikiCon Australia, and we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!
Latest comment: 1 month ago4 comments2 people in discussion
The use of f:Z33068 (en: paragraph from sentences) always most of the time results in the error "Reached time limit in orchestrator." So should it be used at all for now, temporarily? Or maybe there are already some tickets about this that I am not aware of? -- Asked42 (talk) 18:23, 10 May 2026 (UTC)
It previously worked, so I'm hoping whatever the issue is will be fixed soon. The current state of the wiki makes immediatism an illogical philosophy to have at the moment, in my opinion. Feeglgeef (talk) 20:16, 10 May 2026 (UTC)
This issue seems to no longer exist, you might need to clear the cache by making a dummy edit on affected articles. I implemented the function in Python (instead of composition), which has increased the speed by about 2000 ms. Feeglgeef (talk) 04:31, 11 May 2026 (UTC)
I've added some of them. Thank you for your list of best practices by the way, I generally agree with them (I can't endorse connecting WD items currently in hope of a better technical solution, but otherwise I do). Feeglgeef (talk) 00:58, 15 May 2026 (UTC)
Proposals on the architecture of Abstract Content rendering
Latest comment: 1 month ago1 comment1 person in discussion
Starting from a discussion born on the Wikifunctions Telegram chat, I've explained two different proposals on how the NLG on Abstract Wikipedia should be organized in the page User:Dv103/Abstract articles architectures. Please come to contribute to the discussion, or to propose alternatives. Dv103 (talk) 14:32, 11 May 2026 (UTC)
Latest comment: 1 month ago3 comments2 people in discussion
The copyright message has a few issues, mainly that it does not address abstract content (which is not "text"), nor object labels (which are under CC0). Proposed text:
"
Text and abstract content are available under the [https://creativecommons.org/licenses/by-sa/4.0/ Creative Commons Attribution-ShareAlike License]; the labels of objects from [[f:|Wikifunctions]] are available under the [https://creativecommons.org/publicdomain/zero/1.0/ Creative Commons CC0 License]; additional terms may apply. See [https://foundation.wikimedia.org/wiki/Special:MyLanguage/Policy:Terms_of_Use Terms of Use] for details.
"
@Feeglgeef: As you probably imagine, this text is very tightly controlled by the Wikimedia Foundation's Legal department. I think the first port of call should be a discussion about why you might want to change things, rather than a rush to specific re-wordings, before approaching them for sign-off. Jdforrester (WMF) (talk) 14:28, 12 May 2026 (UTC)
Well I briefly summarized my justification, my proposed text was just an idea for a starting point. Basically, the abstract content is not really "text", so the footer doesn't actually tell the reader what they're allowed to do with it, and I argue that the current wording implies that the labels are licensed under CC-BY-SA 4, when they aren't. I think a change is critical as otherwise we're not showing the reader a license, or, arguably, in the case of labels, showing users a license that does not apply, which is a pretty bad thing. Feeglgeef (talk) 14:51, 12 May 2026 (UTC)
Latest comment: 1 month ago8 comments4 people in discussion
https://abstract-data.toolforge.org/ is a very useful tool, but it doesn't seem to have anything pointing to the location of its source code. Is it closed-source or unfree?
While I'm at it I feel I should suggest having language pages' paths formatted differently from how they are currently—at the moment it's /languages/Toki%20Pona when something like /languages/tok would be more suitable. —rae5e<talk>
Thank you. I think a link to that repository should be put in a footer somewhere on the website as I wouldn't have found that if not for you. —rae5e<talk>02:30, 13 May 2026 (UTC)
Thanks for the feedback and the report, fixed both things, we have now the link to the repo in the side bar footer, and also the language codes are used in the URLs. Thanks! DSantamaria-WMF (talk) 11:46, 13 May 2026 (UTC)
I think the use of flags is not really harmful since this is not a language selector, it is just a list of languages. Of course not every language has an associated flag, for example Latin, which does look a bit awkward, but it can't be helped. Maybe give languages without a flag a unique icon from Wikimedia Commons? e.g.
I thought I saw 🏛️ being used for Latin earlier. But of course, as far as the Unicode Consortium is concerned, 🇬🇧 doesn't mean 'English', no more than 🏳️🌈 means 'Polari'; technically, it doesn't even mean 'United Kingdom', just the letters 🇺 and 🇰 next to each other (which linguistically would indicate Ukrainian in ISO 639, but these are 'regional indicator symbols'). Arlo Barnes (talk) 17:21, 14 May 2026 (UTC)
As seems (from this comment and also others' comments) that flags are a little bit controversial, I just removed them, IMHO it was just an aesthetic concept, I like the look and feel that flags bring to the language labels, but happy to remove them if make some people feel uncomfortable with the concept. DSantamaria-WMF (talk) 05:54, 15 May 2026 (UTC)
Latest comment: 1 month ago2 comments2 people in discussion
Make it more user-friendly by adding drag-and-drop compenots and one click additions, and make it similar to a visual macro buidler. ChippyTechGH (talk) 01:00, 15 May 2026 (UTC)
Latest comment: 1 month ago6 comments2 people in discussion
Hi all, I've just created infobox for person (Z35167). All it can really do presently is display the name of the person in the chosen language and in the person's native language, but it's a good start. I started writing it as a composition but chose to switch to writing it in JavaScript instead, which ended up being less headache-inducing (managing a large list of strings to concatenate together is not fun, plus compositions bring lots of additional overhead). Unfortunately, this means that the actual useful parts of the infobox—the information about the person themself—is not really doable. I'd need to dereference the Wikidata property references to get their label in the requested language, but since they're references and I'm working with code, not compositions, I can't just call fetch Wikidata property (Z6822) and be done with it. If there's a WikiLambda API to do this inside of the JS itself that would be great and would solve everything, but at the moment that alone is what's holding it back, or so it seems. I can get the actual values of statements just fine, though. If I am missing some amazing WikiLambda API function that will solve this problem for me by letting me dereference the PID, then please let me know because that would be great.
If you want to look at an example of the infobox right meow, you can view Q317521 in Toki Pona and see how it states Musk's name in both Toki Pona and his native English name. COOL!
My hope is that this can be made less barebones in the near future. Infoboxes are a great way to get quick information on a subject and can fill in the informational gaps in the presently sparse articles we have at the moment, all while staying in the user's requested language. Feedback is appreciated! —rae5e<talk>02:58, 15 May 2026 (UTC)
Thank you for this! I wonder if perhaps it would be better to create one "infobox" function and then have compositions call that, making wd retrival possible? Feeglgeef (talk) 03:38, 15 May 2026 (UTC)
Working on that at the moment, I think I've got it working but I'm hitting a ratelimit in the orchestrator so I don't know for sure. The main issue with it now is that the actual infobox template (Z35175) wrapper was programmed in JS so as to not be a pain in the ass, so this means that it can only take in strings, HTML fragments, etc. so the retrieval of Wikidata stuff still needs to be done by the caller and is rather verbose (see the main implementation). The niceties will come soon later once I know that it all works at a lower level, though. —rae5e<talk>05:04, 15 May 2026 (UTC)
So the problem now seems to be that when label of Wikidata property in language (Z29825) isn't being faulty, it's being incredibly slow. Considering retrieving Wikidata items from their references seems to take, like, four to five seconds, and we'll have tens of statements in these infoboxes, I don't see how the property names could be retrieved in a way that doesn't cause the orchestrator to time out or hit a ratelimit. —rae5e<talk>06:01, 15 May 2026 (UTC)
It works now, a little rough around the edges, also very slow, but I think I'm mostly happy with it. Of course it would be great if all you needed to provide to the template function to get a label-statement row was the property reference itself, but I'm too sleep-deprived to figure that out at the moment. I made five bespoke functions specifically for this infobox, more than I would like, but it's probably not that bad. I'm just hoping I didn't duplicate any functionality in trying to make the composition less verbose and repetitive. —rae5e<talk>07:15, 15 May 2026 (UTC)
Wikifunctions & Abstract Wikipedia Newsletter #248 is out: A higher meaning
Latest comment: 1 month ago1 comment1 person in discussion
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we discuss functions creating language fragments, we present our latest news in Types, and we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!