Page MenuHomePhabricator

diego (Diego S-T)
Senior Research Scientist

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Aug 8 2017, 10:56 AM (380 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Diego (WMF) [ Global Accounts ]

Recent Activity

Fri, Nov 8

diego added a subtask for T371865: Who are moderators?: T360794: Implement stream of HTML content on mw.page_change event.
Fri, Nov 8, 5:58 PM · OKR-Work, Research, Epic
diego removed a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T360794: Implement stream of HTML content on mw.page_change event.
Fri, Nov 8, 5:58 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego edited parent tasks for T360794: Implement stream of HTML content on mw.page_change event, added: T371865: Who are moderators?; removed: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Fri, Nov 8, 5:58 PM · Data-Engineering, Event-Platform
diego added a subtask for T371865: Who are moderators?: T351225: Productionized Edit Types.
Fri, Nov 8, 5:57 PM · OKR-Work, Research, Epic
diego removed a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T351225: Productionized Edit Types.
Fri, Nov 8, 5:57 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego edited parent tasks for T351225: Productionized Edit Types, added: T371865: Who are moderators?; removed: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Fri, Nov 8, 5:57 PM · Research (FY2024-25-Research-January-March), Event-Platform, Data-Engineering, Research-engineering
diego added a project to T378617: Update mwedittypes to handle HTML diffs: Research-engineering.
Fri, Nov 8, 5:53 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work
diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Progress update on the hypothesis for the week

Fri, Nov 8, 5:53 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego added a comment to T377324: [SDS 1.2.3] Quantitative lead to support the definition of moderators.

Weekly report:

  • @Pablo continued review of extensions to determine if there were any major sources of log data that we were missing. He identified four major ones with confirmation from SW to make sure are included
Fri, Nov 8, 4:50 PM · Research (FY2024-25-Research-October-December), OKR-Work

Thu, Nov 7

diego added a comment to T379223: Evaluate the impact of temporary accounts on automoderator.

I can confirm that this is the expected behavior.

Thu, Nov 7, 1:56 PM · Temporary accounts, Trust and Safety Product Team, Research, Automoderator, Moderator-Tools-Team
diego set Due Date to Sun, Nov 24, 11:00 PM on T378761: HTML diff dataset for SDS 1.2.3.
Thu, Nov 7, 8:26 AM · Research-engineering, Research

Wed, Nov 6

diego changed Due Date from Thu, Nov 21, 11:00 PM to Sun, Nov 24, 11:00 PM on T378617: Update mwedittypes to handle HTML diffs.
Wed, Nov 6, 4:58 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work
diego updated subscribers of T378617: Update mwedittypes to handle HTML diffs.

@XiaoXiao-WMF this task is high priority for SDS 1.2.3, please let me know how to proceed.

Wed, Nov 6, 4:57 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work
diego set Due Date to Thu, Nov 21, 11:00 PM on T378617: Update mwedittypes to handle HTML diffs.
Wed, Nov 6, 4:57 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work
diego triaged T378617: Update mwedittypes to handle HTML diffs as High priority.
Wed, Nov 6, 4:55 PM · Research-engineering, Research (FY2024-25-Research-October-December), OKR-Work

Tue, Nov 5

diego updated subscribers of T343938: [SPIKE] How might the Editing Team leverage the "revert risk" model to identify high value checks?.

Noting that T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context is done, so it's possible to check the contents of an edit before it's saved. The questions about how to interpret the score and what thresholds would be concerning are the focus of a hypothesis in Q2, WE4.2.11a that @Kgraessle is working on.

@Samwalton9 and probably @KCVelaga_WMF can comment on this.

Tue, Nov 5, 7:00 PM · Product-Analytics, Editing-team (Tracking), EditCheck, VisualEditor
diego changed the status of T378761: HTML diff dataset for SDS 1.2.3, a subtask of T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator, from Open to In Progress.
Tue, Nov 5, 6:12 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego changed the status of T378761: HTML diff dataset for SDS 1.2.3 from Open to In Progress.
Tue, Nov 5, 6:12 PM · Research-engineering, Research
diego added a comment to T378761: HTML diff dataset for SDS 1.2.3.

My understanding is that we need to work on this two things in parallel. The first one is to be able to not stop the work SDS 1.2.3 and the second one is to fully accomplish the goal of this project.
@fkaelin can provide more details.

Tue, Nov 5, 5:55 PM · Research-engineering, Research

Fri, Nov 1

diego reopened Unknown Object (Task), a subtask of T335799: Review papers and give feedback, as Open.
Fri, Nov 1, 4:56 PM · Epic, Research-outreach, Research
diego added a parent task for T378761: HTML diff dataset for SDS 1.2.3: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Fri, Nov 1, 4:54 PM · Research-engineering, Research
diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T378761: HTML diff dataset for SDS 1.2.3.
Fri, Nov 1, 4:54 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Progress update on the hypothesis for the week

Fri, Nov 1, 4:53 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego added a parent task for T360794: Implement stream of HTML content on mw.page_change event: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Fri, Nov 1, 4:40 PM · Data-Engineering, Event-Platform
diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T360794: Implement stream of HTML content on mw.page_change event.
Fri, Nov 1, 4:40 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 18 2024

diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Weekly update:

Oct 18 2024, 4:16 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 16 2024

diego added a comment to T371902: Request to host the Reference Need Model on LiftWing.

Thanks @achou , and also to @Aitolkyn and @MunizaA , you all did amazing work on making this model faster! The speedup is really impressive and you used cutting edge methods for making this possible. This improvement makes a huge difference from the final user perspective, and specially for the WME use case.

Oct 16 2024, 5:06 PM · Lift-Wing, Machine-Learning-Team

Oct 14 2024

diego removed a subtask for T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products: T377157: Support SDS 1.2.1 B.
Oct 14 2024, 7:38 PM · OKR-Work, Research
diego added a subtask for T377159: [SDS 1.2.1 B] Test existing AI models for internal use-cases: T377157: Support SDS 1.2.1 B.
Oct 14 2024, 7:38 PM · Research (FY2024-25-Research-October-December)
diego edited parent tasks for T377157: Support SDS 1.2.1 B, added: T377159: [SDS 1.2.1 B] Test existing AI models for internal use-cases; removed: T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products.
Oct 14 2024, 7:38 PM · Research
diego added a comment to T377157: Support SDS 1.2.1 B.

@Miriam pls confirm or update the parent task.

Oct 14 2024, 4:15 PM · Research
diego added a subtask for T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products: T377157: Support SDS 1.2.1 B.
Oct 14 2024, 4:15 PM · OKR-Work, Research
diego added a parent task for T377157: Support SDS 1.2.1 B: T370134: SDS 1.2.1: Define and prioritize existing use-cases for AI integration into products.
Oct 14 2024, 4:15 PM · Research
diego created T377157: Support SDS 1.2.1 B.
Oct 14 2024, 4:12 PM · Research
diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Hi @leila , thanks for your words.
I'm optimistic about the project ending on time, as you said we have a great team. I just highlighted the time constrains to explain why we are focusing on offline data for this quarter. However, depending on how these definitions are going to be used in the future, it would be interesting to think how they can work with live data (like real time monitoring), but for now, this is out of the scope for this quarter.

Oct 14 2024, 3:58 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego changed the status of Unknown Object (Task), a subtask of T335799: Review papers and give feedback, from Open to Stalled.
Oct 14 2024, 11:21 AM · Epic, Research-outreach, Research

Oct 11 2024

diego updated the task description for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Oct 11 2024, 5:06 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego added a comment to T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.

Progress update on the hypothesis for the week

  • We have define the list of participants in this project. Apart from myself this includes two 3 people from research (Isaac, Pablo and Yu-Ming), 2 from design research (Claudia and Eli), one from Moderation Tools team (Sam), one Product Analytics (KC) and one from Product Design (Olga T.)
  • Given the size of the team, we decided to split the work in two branches, a qualitative piece lead by Claudia, and a quantitative lead by Isaac.
  • Together with the KR owner (Leila) the hypothesis was defined as: If we combine existing knowledge about  moderators with quantitative methods for detecting moderation activity, we can systematically define and identify Wikipedia moderators.
Oct 11 2024, 4:56 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 9 2024

diego added a parent task for T351225: Productionized Edit Types: T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator.
Oct 9 2024, 2:13 PM · Research (FY2024-25-Research-January-March), Event-Platform, Data-Engineering, Research-engineering
diego added a subtask for T376684: [SDS 1.2.3] Develop a working definition for moderation activity and moderator: T351225: Productionized Edit Types.
Oct 9 2024, 2:13 PM · Research (FY2024-25-Research-October-December), OKR-Work

Oct 7 2024

diego added a subtask for T335799: Review papers and give feedback: Unknown Object (Task).
Oct 7 2024, 11:21 AM · Epic, Research-outreach, Research

Oct 4 2024

diego added a comment to T371158: [SPIKE] What percentage of edits are reverted because of peacock behavior?.

Hi @MNeisler ! I've just shared on Slack the list and code to create the peacock related words. Just for the records a put the links to notebook and list here.

Oct 4 2024, 2:15 PM · Editing-team (Kanban Board), Product-Analytics (Kanban), EditCheck

Oct 2 2024

diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs as Resolved.
Oct 2 2024, 5:54 PM · Research (FY2024-25-Research-July-September)
diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.
Oct 2 2024, 5:53 PM · Research (FY2024-25-Research-July-September)
diego closed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs, a subtask of T365301: Peacock Check: Prompt people to revise promotional language, as Resolved.
Oct 2 2024, 5:53 PM · EditCheck, Editing-team, VisualEditor
diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Confirm if the hypothesis was supported or contradicted

Oct 2 2024, 5:32 PM · Research (FY2024-25-Research-July-September)

Sep 13 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

  • I’m working on building a set of keywords related to peacock behavior and promotional tone. To do this, I’m using a TF-IDF approach, a well-known method to identify terms (keywords) that characterize a set of documents.
  • This and next week are short for me (taking several days off), so it might take a bit more time to finalize this.
  • I also communicated with my manager that there might be the possibility of trying to build a product based on the fine-tune model. In case we decide to move forward, we would need to coordinate with her and other teams involved how to proceed.
Sep 13 2024, 11:59 PM · Research (FY2024-25-Research-July-September)
diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

I see an opportunity to use this endpoint for evaluating new pages. Any thoughts @achou or @diego ?

Sep 13 2024, 2:08 PM · Temporary accounts, Machine-Learning-Team

Sep 7 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

  • Experiments:
    • As planned I studied the ability of the model fine tuned to detect peacock behavior to detect other promotion-related content issues, described in this data set.
    • I run the model on 4 other datasets: {{fanpov}}, {{advert}}, {{autobiography}}, {{weasel}}
    • The results show (see below) a similar behavior with the peacock detection task. The model shows a good precision and low recall (lower for templates different from peacock). This suggest that there is information about promotional tone that can be detect by the model, and depending on the setup the model could focus on precision or recall
  • Coordination:
    • We have a meeting with Peter Pelberg, Nicola Ayub , and Megan Neisler to discuss next steps.
    • First, we decided that the model needs to be tested again a simple baseline, that can be just a string matching approach, looking for common peacock keywords. I’ll be working on this during the next week(s) (notice I’ll be OoO few days during the next two weeks)
    • Peter is going to decide if we want to go deeper on this specific task, and analysis how other factors related to transform this model into a product (serving time, ux, etc) or work on other tasks that involves ML and user experiences
Sep 7 2024, 9:28 AM · Research (FY2024-25-Research-July-September)

Sep 4 2024

diego added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@achou, just for my curiosity, is the "predict time" the total end-to-end period or total = preprocess + predict?

Sep 4 2024, 10:08 AM · Temporary accounts, Machine-Learning-Team

Aug 30 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 30 2024, 10:48 PM · Research (FY2024-25-Research-July-September)

Aug 23 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 23 2024, 8:48 PM · Research (FY2024-25-Research-July-September)
diego added a comment to T372298: [SPIKE]Perform a load test for Multilingual Revert Risk on LiftWing[4H].

This looks great @jsn.sherman. Do you know if there is an overlap on the revisions that returns an error for each model?
I'm just wondering if the ML fails on very long diffs (given that needs to process the text itself).

Aug 23 2024, 8:45 PM · Moderator-Tools-Team (Kanban), Machine-Learning-Team, Automoderator

Aug 19 2024

diego updated subscribers of T372747: Repeat Automoderator testing process with Multilingual Revert Risk data.

I would also like to tag @Pablo @diego: do we have regular snapshots of revert risk scores based on the multilingual model as well, or even a single snapshot for a few months?

I don't know, maybe @fkaelin knows.

Aug 19 2024, 12:11 PM · Moderator-Tools-Team, Product-Analytics (Kanban), Automoderator

Aug 14 2024

diego added a subtask for T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns: T372479: RS Support for SDS 1.2.2.
Aug 14 2024, 3:50 PM · Research (FY2024-25-Research-October-December), OKR-Work
diego added a parent task for T372479: RS Support for SDS 1.2.2: T368791: SDS 1.2.2 Causes behind human administration recruiting, retention, or departure patterns.
Aug 14 2024, 3:50 PM · Research
diego updated the task description for T372479: RS Support for SDS 1.2.2.
Aug 14 2024, 3:49 PM · Research
diego created T372479: RS Support for SDS 1.2.2.
Aug 14 2024, 3:48 PM · Research

Aug 8 2024

diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

@Samwalton9-WMF , just keep in mind that the scores from RRML and RRLA are different. This means that you maybe need to run new users' test to (re)define the thresholds.

Aug 8 2024, 3:06 PM · Machine-Learning-Team, Automoderator, Moderator-Tools-Team
diego added a comment to T365581: Use multilingual revert risk model in Automoderator on supported wikis.

Hi @Samwalton9-WMF , we choose RRLA because it was more stable, but since then, we made some updates to RRML (it was not only about serving time, but getting errors for some revisions), that aimed to make it more stable.
So, if there is interest to switch to RRML (for 47 languages with coverage), my recommendation would be to run some stress test on that service, and measure the % of errors and if Automoderator can tolerate them.

Aug 8 2024, 3:00 PM · Machine-Learning-Team, Automoderator, Moderator-Tools-Team

Aug 2 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

Aug 2 2024, 6:48 PM · Research (FY2024-25-Research-July-September)

Jul 26 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Progress update

  • I've been coordinating with ML-team to show code examples that make their (experimental) infrastructure to fail. They will be using this code as part of their use-case studies when testing new LLMs infrastructure.
  • In the meantime I've been working on writing code to fine-tune smaller Language Models, this requires:
    • Data preprocessing and cleaning (done)
    • Experimental design (done)
    • Run experiments on stats machine (in progress)
  • Met with KR owner (Peter Pelberg) and explain the progress and next steps for this hypothesis.
Jul 26 2024, 3:08 PM · Research (FY2024-25-Research-July-September)

Jul 23 2024

diego reopened T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as "Open".
Jul 23 2024, 9:44 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)
diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Hi! Apparently the data has missing again:

Jul 23 2024, 9:43 AM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

Jul 22 2024

diego updated subscribers of T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.
Jul 22 2024, 4:02 PM · Research (FY2024-25-Research-July-September)

Jul 19 2024

diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.
  • Studied how to create prompts for Gemma2. Noticed the importance of using special tokens and format.
  • Designed zero-shot experiment for detecting Peacock behavior.
  • Wrote code for testing the Gemma2 instance hosted by the ML-team.
    • The instance took more than 5 seconds per query.
    • After few requests (around 200) the instance stop responding.
    • O've reported this issue to ML-Team, my understanding is they will be working on fixing this during the next week (cc: Chris Albon)
Jul 19 2024, 7:42 PM · Research (FY2024-25-Research-July-September)
diego merged T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.
Jul 19 2024, 10:57 AM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering
diego merged task T363718: Deploy Wikidata Revert Risk to LiftWing into T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.
Jul 19 2024, 10:55 AM · Research

Jul 18 2024

diego updated subscribers of T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.

You are right @leila we should merge them.

Jul 18 2024, 3:37 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 12 2024

diego updated the task description for T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.
Jul 12 2024, 4:09 PM · Research (FY2024-25-Research-July-September)
diego renamed T368274: [WE1.2.4] Detecting Peacock behavior with LLMs from [W.E.1.2.4] Detecting Peacook behavior with LLMs to [W.E.1.2.4] Detecting Peacock behavior with LLMs.
Jul 12 2024, 3:53 PM · Research (FY2024-25-Research-July-September)
diego added a comment to T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.

Based on our previous research, we have created a dataset containing 9276 articles affected by peacock and other related policy violations on English Wikipedia. For each of them we have negative (no policy violations) and positive examples: * Autobiography: 1472

  • fanpov: 350
  • peacock 2587
  • weasel 805
  • advert: 4062
  • Total: 9276
Jul 12 2024, 3:52 PM · Research (FY2024-25-Research-July-September)

Jul 5 2024

diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata as Resolved.
Jul 5 2024, 3:54 PM · Research, Wikidata data quality and trust, Wikidata
diego closed T328813: Develop a ML-based service to detect vandalism on Wikidata, a subtask of T333892: Develop a new generation of ML models for Wikidata, as Resolved.
Jul 5 2024, 3:52 PM · Research-Freezer, Epic, Wikidata data quality and trust, Wikidata, address-knowledge-gaps, Knowledge-Integrity
diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

I'm resolving the task and track model's deployment in T369371

Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata
diego added a subtask for T328813: Develop a ML-based service to detect vandalism on Wikidata: T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.
Jul 5 2024, 3:52 PM · Research, Wikidata data quality and trust, Wikidata
diego added a parent task for T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model: T328813: Develop a ML-based service to detect vandalism on Wikidata.
Jul 5 2024, 3:52 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering
diego created T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model.
Jul 5 2024, 3:48 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Jul 4 2024

diego added a comment to T367444: Replace or remove Debian Buster VMs in 'wmf-research-tools' cloud-vps project.

@diego:

I've just removed these two

Jul 4 2024, 6:22 PM · cloud-services-team, Cloud-VPS (Debian Buster Deprecation), Research
diego updated the task description for T357033: Reference model Research and Development work.
Jul 4 2024, 6:10 PM · Research (FY2024-25-Research-July-September), Wikimedia Enterprise
diego added a comment to T369055: Investigate deployment of gemma2 on LiftWing.

Thanks for this work @isarantopoulos!

Jul 4 2024, 2:50 PM · Lift-Wing, Machine-Learning-Team

Jun 28 2024

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych, please post here the models' performance results

Jun 28 2024, 8:10 AM · Research, Wikidata data quality and trust, Wikidata
diego added a comment to T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

To keep this task updated, models for Wikipedia are ready and can be found here:

Jun 28 2024, 8:07 AM · Machine-Learning-Team, Research, Epic
diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.

@Trokhymovych has addresed the comments and submitted the merge request. Model binary can be found here.
I'm going to coordinate with research engineers to decide next steps.

Jun 28 2024, 8:02 AM · Research, Wikidata data quality and trust, Wikidata
diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.
Jun 28 2024, 7:58 AM · Research, Wikidata data quality and trust, Wikidata
diego updated subscribers of T328813: Develop a ML-based service to detect vandalism on Wikidata.
Jun 28 2024, 7:57 AM · Research, Wikidata data quality and trust, Wikidata

Jun 25 2024

diego added a comment to T367551: Cloud VPS "research-collaborations-api" project Buster deprecation.

Just for the records, we have migrated the fact-checking API to another instance and deleted the old one.

Jun 25 2024, 9:36 AM · Research, Cloud-VPS (Debian Buster Deprecation)

Jun 24 2024

diego created T368274: [WE1.2.4] Detecting Peacock behavior with LLMs.
Jun 24 2024, 2:24 PM · Research (FY2024-25-Research-July-September)

Jun 18 2024

diego closed T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history as Resolved.
Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)
diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

Thanks @JAllemandou !

Jun 18 2024, 2:23 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 20 2024

diego added a comment to T365360: adopt production ready code structure.

@XiaoXiao-WMF can you please provide more context?

May 20 2024, 1:10 PM · Research

May 6 2024

diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker the proposed solution sounds ok to me. I have two questions around:

May 6 2024, 3:58 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 3 2024

diego updated subscribers of T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.
May 3 2024, 1:51 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)
diego added a comment to T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.

@lbowmaker if understand correctly, there is no alternative for obtaining historical data for Wikidata edits? If this is the case, we can't keep the Wikidata Revert Risk model updated

May 3 2024, 1:50 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

May 2 2024

diego created T364045: [Bug?] Can't find wikidatawiki on wmf.mediawiki_wikitext_history.
May 2 2024, 9:02 PM · Wikidata, Wikidata Analytics, Data-Engineering (Q4 2024 April 1st - June 30th)

Apr 29 2024

diego closed T341820: Evaluate and improve the Revert Risk model for Wikidata. as Resolved.
Apr 29 2024, 3:32 PM · Research (FY2023-24-Research-April-June)
diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

This task has been resolved, please follow the model deployment here: T363718

Apr 29 2024, 3:31 PM · Research (FY2023-24-Research-April-June)
diego closed T341820: Evaluate and improve the Revert Risk model for Wikidata., a subtask of T328813: Develop a ML-based service to detect vandalism on Wikidata, as Resolved.
Apr 29 2024, 3:30 PM · Research, Wikidata data quality and trust, Wikidata
diego created T363718: Deploy Wikidata Revert Risk to LiftWing.
Apr 29 2024, 3:29 PM · Research

Apr 17 2024

diego added a comment to T343064: Expand types of edits for Wikidata revert risk model.

This was solved. More details here T341820

Apr 17 2024, 4:30 PM · Research