In T377324#10303627, @Isaac wrote:

Weekly report:

@Pablo continued review of extensions to determine if there were any major sources of log data that we were missing. He identified four major ones with confirmation from SW to make sure are included

Fri, Nov 8, 4:50 PM · Research (FY2024-25-Research-October-December), OKR-Work

Thu, Nov 7

diego added a comment to T379223: Evaluate the impact of temporary accounts on automoderator.

I can confirm that this is the expected behavior.

Thu, Nov 7, 1:56 PM · Temporary accounts, Trust and Safety Product Team, Research, Automoderator, Moderator-Tools-Team

diego set Due Date to Sun, Nov 24, 11:00 PM on T378761: HTML diff dataset for SDS 1.2.3.

Thu, Nov 7, 8:26 AM · Research-engineering, Research

Wed, Nov 6

diego changed Due Date from Thu, Nov 21, 11:00 PM to Sun, Nov 24, 11:00 PM on T378617: Update mwedittypes to handle HTML diffs.

Wed, Nov 6, 4:58 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work

diego updated subscribers of T378617: Update mwedittypes to handle HTML diffs.

@XiaoXiao-WMF this task is high priority for SDS 1.2.3, please let me know how to proceed.

Wed, Nov 6, 4:57 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work

diego set Due Date to Thu, Nov 21, 11:00 PM on T378617: Update mwedittypes to handle HTML diffs.

Wed, Nov 6, 4:57 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work

diego triaged T378617: Update mwedittypes to handle HTML diffs as High priority.

Wed, Nov 6, 4:55 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work

Tue, Nov 5

diego updated subscribers of T343938: [SPIKE] How might the Editing Team leverage the "revert risk" model to identify high value checks?.

In T343938#10287746, @kostajh wrote:

Noting that T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context is done, so it's possible to check the contents of an edit before it's saved. The questions about how to interpret the score and what thresholds would be concerning are the focus of a hypothesis in Q2, WE4.2.11a that @Kgraessle is working on.

@Samwalton9 and probably @KCVelaga_WMF can comment on this.

Tue, Nov 5, 7:00 PM · Product-Analytics, Editing-team (Tracking), EditCheck, VisualEditor

diego changed the status of T378761: HTML diff dataset for SDS 1.2.3, a subtask of T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator, from Open to In Progress.

Tue, Nov 5, 6:12 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego changed the status of T378761: HTML diff dataset for SDS 1.2.3 from Open to In Progress.

Tue, Nov 5, 6:12 PM · Research-engineering, Research

diego added a comment to T378761: HTML diff dataset for SDS 1.2.3.

My understanding is that we need to work on this two things in parallel. The first one is to be able to not stop the work SDS 1.2.3 and the second one is to fully accomplish the goal of this project.
@fkaelin can provide more details.

Tue, Nov 5, 5:55 PM · Research-engineering, Research

Fri, Nov 1

diego reopened Unknown Object (Task), a subtask of T335799: Review papers and give feedback, as Open.

Fri, Nov 1, 4:56 PM · Epic, Research-outreach, Research

diego added a parent task for T378761: HTML diff dataset for SDS 1.2.3: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Fri, Nov 1, 4:54 PM · Research-engineering, Research

diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T378761: HTML diff dataset for SDS 1.2.3.

Fri, Nov 1, 4:54 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Progress update on the hypothesis for the week

Fri, Nov 1, 4:53 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego added a parent task for T360794: Implement stream of HTML content on mw.page_change event: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Fri, Nov 1, 4:40 PM · Data-Engineering, Event-Platform

diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T360794: Implement stream of HTML content on mw.page_change event.

Fri, Nov 1, 4:40 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 18 2024

diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Weekly update:

Oct 18 2024, 4:16 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 16 2024

diego added a comment to T371902: Request to host the Reference Need Model on LiftWing.

Thanks @achou , and also to @Aitolkyn and @MunizaA , you all did amazing work on making this model faster! The speedup is really impressive and you used cutting edge methods for making this possible. This improvement makes a huge difference from the final user perspective, and specially for the WME use case.

Oct 16 2024, 5:06 PM · Lift-Wing, Machine-Learning-Team

Oct 14 2024

diego removed a subtask for T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products: T377157: Support SDS 1.2.1 B.

Oct 14 2024, 7:38 PM · OKR-Work, Research

diego added a subtask for T377159: [SDS 1.2.1 B] Test existing AI models for internal use-cases: T377157: Support SDS 1.2.1 B.

Oct 14 2024, 7:38 PM · Research (FY2024-25-Research-October-December)

diego edited parent tasks for T377157: Support SDS 1.2.1 B, added: T377159: [SDS 1.2.1 B] Test existing AI models for internal use-cases; removed: T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products.

Oct 14 2024, 7:38 PM · Research

diego added a comment to T377157: Support SDS 1.2.1 B.

@Miriam pls confirm or update the parent task.

Oct 14 2024, 4:15 PM · Research

diego added a subtask for T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products: T377157: Support SDS 1.2.1 B.

Oct 14 2024, 4:15 PM · OKR-Work, Research

diego added a parent task for T377157: Support SDS 1.2.1 B: T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products.

Oct 14 2024, 4:15 PM · Research

diego created T377157: Support SDS 1.2.1 B.

Oct 14 2024, 4:12 PM · Research

diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Hi @leila , thanks for your words.
I'm optimistic about the project ending on time, as you said we have a great team. I just highlighted the time constrains to explain why we are focusing on offline data for this quarter. However, depending on how these definitions are going to be used in the future, it would be interesting to think how they can work with live data (like real time monitoring), but for now, this is out of the scope for this quarter.

Oct 14 2024, 3:58 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego changed the status of Unknown Object (Task), a subtask of T335799: Review papers and give feedback, from Open to Stalled.

Oct 14 2024, 11:21 AM · Epic, Research-outreach, Research

Oct 11 2024

diego updated the task description for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Oct 11 2024, 5:06 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Progress update on the hypothesis for the week

We have define the list of participants in this project. Apart from myself this includes two 3 people from research (Isaac, Pablo and Yu-Ming), 2 from design research (Claudia and Eli), one from Moderation Tools team (Sam), one Product Analytics (KC) and one from Product Design (Olga T.)
Given the size of the team, we decided to split the work in two branches, a qualitative piece lead by Claudia, and a quantitative lead by Isaac.
Together with the KR owner (Leila) the hypothesis was defined as: If we combine existing knowledge about moderators with quantitative methods for detecting moderation activity, we can systematically define and identify Wikipedia moderators.

Oct 11 2024, 4:56 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 9 2024

diego added a parent task for T351225: Productionized Edit Types: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Oct 9 2024, 2:13 PM · Research (FY2024-25-Research-January-March), Event-Platform, Data-Engineering, Research-engineering

diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T351225: Productionized Edit Types.

Oct 9 2024, 2:13 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 7 2024

diego added a subtask for T335799: Review papers and give feedback: Unknown Object (Task).

Oct 7 2024, 11:21 AM · Epic, Research-outreach, Research

Oct 4 2024

diego added a comment to T371158: [SPIKE] What percentage of edits are reverted because of peacock behavior?.

Hi @MNeisler ! I've just shared on Slack the list and code to create the peacock related words. Just for the records a put the links to notebook and list here.

Oct 4 2024, 2:15 PM · Editing-team (Kanban Board), Product-Analytics (Kanban), EditCheck

Oct 2 2024

diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs as Resolved.

Oct 2 2024, 5:54 PM · Research (FY2024-25-Research-July-September)

diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Oct 2 2024, 5:53 PM · Research (FY2024-25-Research-July-September)

diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs, a subtask of T365301: Peacock Check: Prompt people to revise promotional language, as Resolved.

Oct 2 2024, 5:53 PM · EditCheck, Editing-team, VisualEditor

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Confirm if the hypothesis was supported or contradicted

Oct 2 2024, 5:32 PM · Research (FY2024-25-Research-July-September)

Sep 13 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

I’m working on building a set of keywords related to peacock behavior and promotional tone. To do this, I’m using a TF-IDF approach, a well-known method to identify terms (keywords) that characterize a set of documents.
This and next week are short for me (taking several days off), so it might take a bit more time to finalize this.
I also communicated with my manager that there might be the possibility of trying to build a product based on the fine-tune model. In case we decide to move forward, we would need to coordinate with her and other teams involved how to proceed.

Sep 13 2024, 11:59 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

In T356102#10143922, @Strainu wrote:

I see an opportunity to use this endpoint for evaluating new pages. Any thoughts @achou or @diego ?

Sep 13 2024, 2:08 PM · Temporary accounts, Machine-Learning-Team

Sep 7 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Experiments:
- As planned I studied the ability of the model fine tuned to detect peacock behavior to detect other promotion-related content issues, described in this data set.
- I run the model on 4 other datasets: {{fanpov}}, {{advert}}, {{autobiography}}, {{weasel}}
- The results show (see below) a similar behavior with the peacock detection task. The model shows a good precision and low recall (lower for templates different from peacock). This suggest that there is information about promotional tone that can be detect by the model, and depending on the setup the model could focus on precision or recall
Coordination:
- We have a meeting with Peter Pelberg, Nicola Ayub , and Megan Neisler to discuss next steps.
- First, we decided that the model needs to be tested again a simple baseline, that can be just a string matching approach, looking for common peacock keywords. I’ll be working on this during the next week(s) (notice I’ll be OoO few days during the next two weeks)
- Peter is going to decide if we want to go deeper on this specific task, and analysis how other factors related to transform this model into a product (serving time, ux, etc) or work on other tasks that involves ML and user experiences

Sep 7 2024, 9:28 AM · Research (FY2024-25-Research-July-September)

Sep 4 2024

diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@achou, just for my curiosity, is the "predict time" the total end-to-end period or total = preprocess + predict?

Sep 4 2024, 10:08 AM · Temporary accounts, Machine-Learning-Team

Aug 30 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 30 2024, 10:48 PM · Research (FY2024-25-Research-July-September)

Aug 23 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 23 2024, 8:48 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T372298: [SPIKE]Perform a load test for Multilingual Revert Risk on LiftWing[4H].

This looks great @jsn.sherman. Do you know if there is an overlap on the revisions that returns an error for each model?
I'm just wondering if the ML fails on very long diffs (given that needs to process the text itself).

Aug 23 2024, 8:45 PM · Moderator-Tools-Team (Kanban), Machine-Learning-Team, Automoderator

Aug 19 2024

diego updated subscribers of T372747: Repeat Automoderator testing process with Multilingual Revert Risk data.

I would also like to tag @Pablo @diego: do we have regular snapshots of revert risk scores based on the multilingual model as well, or even a single snapshot for a few months?

I don't know, maybe @fkaelin knows.

Aug 19 2024, 12:11 PM · Moderator-Tools-Team, Product-Analytics (Kanban), Automoderator

Aug 14 2024

diego added a subtask for T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns: T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:50 PM · Research (FY2024-25-Research-October-December), OKR-Work

diego added a parent task for T372479: RS Support for SDS 1.2.2: T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns.

Aug 14 2024, 3:50 PM · Research

diego updated the task description for T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:49 PM · Research

diego created T372479: RS Support for SDS 1.2.2.

Aug 14 2024, 3:48 PM · Research

Aug 8 2024

diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

@Samwalton9-WMF , just keep in mind that the scores from RRML and RRLA are different. This means that you maybe need to run new users' test to (re)define the thresholds.

Aug 8 2024, 3:06 PM · Machine-Learning-Team, Automoderator, Moderator-Tools-Team

diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

Hi @Samwalton9-WMF , we choose RRLA because it was more stable, but since then, we made some updates to RRML (it was not only about serving time, but getting errors for some revisions), that aimed to make it more stable.
So, if there is interest to switch to RRML (for 47 languages with coverage), my recommendation would be to run some stress test on that service, and measure the % of errors and if Automoderator can tolerate them.

Aug 8 2024, 3:00 PM · Machine-Learning-Team, Automoderator, Moderator-Tools-Team

Aug 2 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 2 2024, 6:48 PM · Research (FY2024-25-Research-July-September)

Jul 26 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

I've been coordinating with ML-team to show code examples that make their (experimental) infrastructure to fail. They will be using this code as part of their use-case studies when testing new LLMs infrastructure.
In the meantime I've been working on writing code to fine-tune smaller Language Models, this requires:
- Data preprocessing and cleaning (done)
- Experimental design (done)
- Run experiments on stats machine (in progress)
Met with KR owner (Peter Pelberg) and explain the progress and next steps for this hypothesis.

Jul 26 2024, 3:08 PM · Research (FY2024-25-Research-July-September)

Jul 23 2024

diego reopened T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as "Open".

Jul 23 2024, 9:44 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Hi! Apparently the data has missing again:

Jul 23 2024, 9:43 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

Jul 22 2024

diego updated subscribers of T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jul 22 2024, 4:02 PM · Research (FY2024-25-Research-July-September)

Jul 19 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Studied how to create prompts for Gemma2. Noticed the importance of using special tokens and format.
Designed zero-shot experiment for detecting Peacock behavior.
Wrote code for testing the Gemma2 instance hosted by the ML-team.
- The instance took more than 5 seconds per query.
- After few requests (around 200) the instance stop responding.
- O've reported this issue to ML-Team, my understanding is they will be working on fixing this during the next week (cc: Chris Albon)

Jul 19 2024, 7:42 PM · Research (FY2024-25-Research-July-September)

diego merged T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 19 2024, 10:57 AM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

diego merged task T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 19 2024, 10:55 AM · Research

Jul 18 2024

diego updated subscribers of T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

You are right @leila we should merge them.

Jul 18 2024, 3:37 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 12 2024

diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jul 12 2024, 4:09 PM · Research (FY2024-25-Research-July-September)

diego renamed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs from [W.E.1.2.4] Detecting Peacook behavior with LLMs to [W.E.1.2.4] Detecting Peacock behavior with LLMs.

Jul 12 2024, 3:53 PM · Research (FY2024-25-Research-July-September)

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Based on our previous research, we have created a dataset containing 9276 articles affected by peacock and other related policy violations on English Wikipedia. For each of them we have negative (no policy violations) and positive examples: * Autobiography: 1472

fanpov: 350
peacock 2587
weasel 805
advert: 4062
Total: 9276

Jul 12 2024, 3:52 PM · Research (FY2024-25-Research-July-September)

Jul 5 2024

diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata as Resolved.

Jul 5 2024, 3:54 PM · Research, Wikidata data quality and trust, Wikidata

diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata, a subtask of T333892: Develop a new generation of ML models for Wikidata, as Resolved.

Jul 5 2024, 3:52 PM · Research-Freezer, Epic, Wikidata data quality and trust, Wikidata, address-knowledge-gaps, Knowledge-Integrity

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

I'm resolving the task and track model's deployment in T369371

Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata

diego added a subtask for T328813: Develop a ML-based service to detect vandalism on Wikidata: T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata

diego added a parent task for T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model: T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jul 5 2024, 3:52 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

diego created T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

Jul 5 2024, 3:48 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 4 2024

diego added a comment to T367444: Replace or remove Debian Buster VMs in 'wmf-research-tools' cloud-vps project.

@diego:

covid-data.wmf-research-tools.eqiad1.wikimedia.cloud (this one is shut-off so maybe just needs deleted?)

wikipediaWikidata.wmf-research-tools.eqiad1.wikimedia.cloud

I've just removed these two

Jul 4 2024, 6:22 PM · cloud-services-team, Cloud-VPS (Debian Buster Deprecation), Research

diego updated the task description for T357033: Reference model Research and Development work.

Jul 4 2024, 6:10 PM · Research (FY2024-25-Research-July-September), Wikimedia Enterprise

diego added a comment to T369055: Investigate deployment of gemma2 on LiftWing.

Thanks for this work @isarantopoulos!

Jul 4 2024, 2:50 PM · Lift-Wing, Machine-Learning-Team

Jun 28 2024

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych, please post here the models' performance results

Jun 28 2024, 8:10 AM · Research, Wikidata data quality and trust, Wikidata

diego added a comment to T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

To keep this task updated, models for Wikipedia are ready and can be found here:

Jun 28 2024, 8:07 AM · Machine-Learning-Team, Research, Epic

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych has addresed the comments and submitted the merge request. Model binary can be found here.
I'm going to coordinate with research engineers to decide next steps.

Jun 28 2024, 8:02 AM · Research, Wikidata data quality and trust, Wikidata

diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jun 28 2024, 7:58 AM · Research, Wikidata data quality and trust, Wikidata

diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.

Jun 28 2024, 7:57 AM · Research, Wikidata data quality and trust, Wikidata

Jun 25 2024

diego added a comment to T367551: Cloud VPS "research-collaborations-api" project Buster deprecation.

Just for the records, we have migrated the fact-checking API to another instance and deleted the old one.

Jun 25 2024, 9:36 AM · Research, Cloud-VPS (Debian Buster Deprecation)

Jun 24 2024

diego created T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Jun 24 2024, 2:24 PM · Research (FY2024-25-Research-July-September)

Jun 18 2024

diego closed T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as Resolved.

Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Thanks @JAllemandou !

Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 20 2024

diego added a comment to T365360: adopt production ready code structure.

@XiaoXiao-WMF can you please provide more context?

May 20 2024, 1:10 PM · Research

May 6 2024

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker the proposed solution sounds ok to me. I have two questions around:

May 6 2024, 3:58 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 3 2024

diego updated subscribers of T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

May 3 2024, 1:51 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker if understand correctly, there is no alternative for obtaining historical data for Wikidata edits? If this is the case, we can't keep the Wikidata Revert Risk model updated

May 3 2024, 1:50 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)