Canadian ReviewsCanadian Reviews
  • What’s On
  • Reviews
  • Digital World
  • Lifestyle
  • Travel
  • Trending
  • Web Stories
Trending Now

A Horror Flick From The Dog’s POV

Budapest’s Legendary Gellért Thermal Bath Closes Today for Three Years of Renovations, Canada Reviews

Have dietary demands taken the fun out of dinner parties? | Canada Voices

Netflix’s latest Frankenstein trailer is all about the creature Canada reviews

A Stylish but Neat Hemline Where a Raw Cut Was Desired – front mezz junkies, Theater News

MTG’s next original set was inspired by a beloved 1980s ‘weirdo fantasy’ movie

Sony’s updated PS5 Slim launches with less storage in Europe

Facebook X (Twitter) Instagram
  • Privacy
  • Terms
  • Advertise
  • Contact us
Facebook X (Twitter) Instagram Pinterest Vimeo
Canadian ReviewsCanadian Reviews
  • What’s On
  • Reviews
  • Digital World
  • Lifestyle
  • Travel
  • Trending
  • Web Stories
Newsletter
Canadian ReviewsCanadian Reviews
You are at:Home » Wikimedia wants to make it easier for you and AI developers to search through its data Canada reviews
Reviews

Wikimedia wants to make it easier for you and AI developers to search through its data Canada reviews

1 October 20254 Mins Read

The late English writer Douglas Adams is best known as the author of the 1979 book The Hitchhiker’s Guide to the Galaxy. But there is much more to Adams than what is written in his Wikipedia entry. Whether or not you need to know that his birth sign is Pisces or that libraries worldwide store his books under the same string of numbers — 13230702 — you can if you head to an overlooked corner of the Wikimedia Foundation called Wikidata.

There, images, text, keywords, and other information related to Adams are stored both in a webpage and, for the robots among us, in formats designed for machines like JSON.

Now, Wikidata is getting a new AI-friendly database that makes it easier for large language models to ingest the information. The database comes from the Wikipedia Embedding Project out of the German chapter of the Wikimedia Foundation, Wikimedia Deutschland, which oversees Wikidata. The Berlin-based team spent the past year using a large language model to turn the 19 million entries within Wikidata from clunkily structured data into vectors that capture the context and meaning around the Wikidata entry.

In this vectorized format, information is best imagined like a graph with dots and interconnected lines — Adams would be connected to “human” as well as the titles of his books, Lydia Pintscher, Wikidata portfolio lead, told The Verge.

While the front-end user experience will remain the same — no, Wikipedia is not becoming a chatbot, the project leaders say — the back end will become easier for AI developers to access when building, for example, their own chatbots using the data.

The goal of the project is to level the playing field for AI developers outside the monied core of Big Tech, Pintscher said. Companies like OpenAI and Anthropic have the resources to vectorize Wikidata, just like Pintscher and her team did. It’s the smaller outfits that most benefit from the new access to curated data stored in the vaults of Wikidata. “Really, for me, it’s about giving them that edge up and to at least give them a chance, right?” Pintscher said.

She points to Govdirectory as an example project that harnessed Wikidata’s vast data curated by volunteers for good. The platform allows users to find the social media handles and emails for public officials across the world.

Most AI chatbots prioritize popular words and topics across the internet. In addition to giving Little Tech a leg up, the team hopes that easier access to Wikidata will result in AI systems that better reflect niche topics not widely represented across the internet, Pintscher said. This could be a better way to get information into ChatGPT, for instance, than “generating a ton of content and then waiting for the next time for ChatGPT to retrain, and maybe, or maybe not, taking into account what you contributed,” Pintscher said.

In practice, the vectors will allow AI systems to better access the context around information in addition to the information itself, Philippe Saadé, Wikidata AI project manager, told The Verge.

The team used a model from AI company Jina AI to turn Wikidata’s structured data, captured through September 18th, 2024, into vectors. IBM company DataStax currently provides the infrastructure to store the vector database to the project for free.

The team is waiting for feedback from developers who use the database before updating it with information added over the last year. While the current database does not include entirely new information added in the last year, Saadé says small edits or tweaks to existing Wikidata will not diminish the database’s usefulness. “At the end of the day, the vector that we’re computing is like a general idea of an item, so if some small edit has been made on Wikidata, it’s not going to be super relevant,” he said.

Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.

  • Elissa Welle

    Elissa Welle

    Posts from this author will be added to your daily email digest and your homepage feed.

    See All by Elissa Welle

  • AI

    Posts from this topic will be added to your daily email digest and your homepage feed.

    See All AI

  • News

    Posts from this topic will be added to your daily email digest and your homepage feed.

    See All News

  • Tech

    Posts from this topic will be added to your daily email digest and your homepage feed.

    See All Tech

Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email

Related Articles

Netflix’s latest Frankenstein trailer is all about the creature Canada reviews

Reviews 1 October 2025

A Stylish but Neat Hemline Where a Raw Cut Was Desired – front mezz junkies, Theater News

Reviews 1 October 2025

Samsung, SK Hynix partner with OpenAI to deliver chips for Stargate Canada reviews

Reviews 1 October 2025

The Ray-Ban Meta Gen 2 smart glasses have twice the battery life Canada reviews

Reviews 1 October 2025

Allowing more tailpipe pollution means higher fuel costs for Americans  Canada reviews

Reviews 1 October 2025

Apple asks judge to dismiss Elon Musk’s AI competition lawsuit over OpenAI partnership Canada reviews

Reviews 1 October 2025
Top Articles

The ocean’s ‘sparkly glow’: Here’s where to witness bioluminescence in B.C. 

14 August 2025276 Views

These Ontario employers were just ranked among best in Canada

17 July 2025273 Views

What the research says about Tylenol, pregnancy and autism | Canada Voices

12 September 2025154 Views

Getting a taste of Maori culture in New Zealand’s overlooked Auckland | Canada Voices

12 July 2025140 Views
Demo
Don't Miss
Lifestyle 1 October 2025

MTG’s next original set was inspired by a beloved 1980s ‘weirdo fantasy’ movie

It’s been 18 years since Magic: The Gathering first visited Lorwyn–Shadowmoor, a fantastical setting inspired…

Sony’s updated PS5 Slim launches with less storage in Europe

This Is Officially The Best Hotel In Florence In 2025, According To , Canada Reviews

Lauren Jauregui Joins List of 'Dancing With the Stars’ Most Unforgettable Eliminations

About Us
About Us

Canadian Reviews is your one-stop website for the latest Canadian trends and things to do, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

A Horror Flick From The Dog’s POV

Budapest’s Legendary Gellért Thermal Bath Closes Today for Three Years of Renovations, Canada Reviews

Have dietary demands taken the fun out of dinner parties? | Canada Voices

Most Popular

Why You Should Consider Investing with IC Markets

28 April 202424 Views

OANDA Review – Low costs and no deposit requirements

28 April 2024347 Views

LearnToTrade: A Comprehensive Look at the Controversial Trading School

28 April 202449 Views
© 2025 ThemeSphere. Designed by ThemeSphere.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact us

Type above and press Enter to search. Press Esc to cancel.