Wikimedia Technology/Goals/2019-20 Q3

Q2 Wikimedia Technology Goals, FY2019–20, Q3 (January - March 2020) Q4

Technology Department Team Goals and Status for Q3 FY19/20 in support of the Medium Term Plan (MTP) Priorities and Annual Plan for FY19/20


Team Manager: Nuria Ruiz Team Manager: Nuria Ruiz

Modern Event Platform
Build a reliable, scalable, and comprehensive platform for creating services, tools and user facing features that produce and consume event data
Deploy a new Datastream for analytics in EventGate task T238138
Errors from browser clients are surfaced to developers task T226986
Smart Tools for Better Data. Make easier to understand the history of all Wikimedia projects
stats.wikimedia.org point to Wikistats2 by default task T237752


Smart Tools for Better Data. Increase Data Quality, Privacy and Security
Bots: Label high volume bot spikes in pageview data as automated traffic task T238357
Core. Operational Excellence. Increase Resilience of Systems
Core. Operational Excellence. Reduce Operational Load by Phasing Out Legacy Systems/Technologies

Status


Team Manager: Corey Floyd

Core Work
Enable MultiDC Reads
IP Masking
IP Masking Implementation
Platform Evolution / Modern Event Platform
Initial modularization of MediaWiki (planning) (continued from Q2)
Push notification service
Initial modularization of MediaWiki (one component)
FAWG (desktop refresh work)
Wikimedia Unified API
Tech & Product Partnerships
Developer Portal Implementation
Integrate OAuth 2.0 into API (Phase 2)
Paid API project

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Gilles Dubuc

Core Work
Figure out the right store to use for the main stash (continued from Q1)
Publish 8 blog posts about performance (continued from Q1)
Support and maintenance of MediaWiki's object caching and data access components (continued from Q1)
Support and maintenance of WebPageTest and synthetic testing infrastructure (continued from Q1)
Support and maintenance of MediaWiki's ResourceLoader (continued from Q1)
Support and maintenance of Fresnel (continued from Q1)
Add operational monitoring for 100% of the performance-team services
Have at least 2 years of retention for ArcLamp flame graphs
Organise and run the Web Performance devroom at FOSDEM 2020 (continued from Q2)
Make seen/unseen state of watched pages reliable (all affected users who reported the issue are satisfied)
DC-shared object caches are available and replicate even if 20% of the servers fail
DC-shared temporary data is written via queues or replicated stores (continued from Q2)
Shared caching and temporary data storage use established/documented patterns
A memory-only lightweight store for temporary data exists and supports global (cross-wiki) keys (continued from Q2)
A disk-backed lightweight store for temporary data exists and supports global (cross-wiki) keys (continued from Q2)
Document how to add your own User Timing and how to see it in RUM and synthetic testing
Collect and graph First Input Delay
Document how to add your own Element Timing and how to see it in synthetic testing
Document how to add your own user journey for synthetic testing
Document the search user journeys in synthetic testing
Platform Evolution / Modern Event Platform
Provide performance expertise to FAWG outcome (continued from Q1)

Dependencies on:


Team Manager: JR Branaa

Core Work
Team inception, formalization, and assessment of current organizational practices (continued from Q1)
Add all deployed to production repos to the Code Health pipeline (Code Health Metrics).
Solicit feedback from current users of CHM POC and define phase 2 enhancements (continued from Q2)
Interview engineering teams to understand their current code review practices (continued from Q2)
Relaunch the Code Review Office Hours (continued from Q2)
Put in place Code Review performance metrics (continued from Q2)
Develop Test Strategy for CPT
Platform Evolution / Modern Event Platform
Make CI warn about slow tests, and publish a collated list of slow tests
Transfer maintainership/ownership of API Test Tooling from CPT

Dependencies on:


Team Manager: Tyler Cipriani

Core Work
Set up an experimental elastic search instance to store and analyze CI logs and metrics
Continuation of Phabricator and Gerrit improvement (in conjunction with SRE) (continued from Q2)
Migrate from Gerrit version 2.15 to 2.16
Platform Evolution / Modern Event Platform
Other service deployment pipeline migrations as prioritized between SRE/RelEng and relevant teams (continued from Q2)
A demonstration MediaWiki development environment hosts the full TimedMediaHandler front-end and back-end workflow

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Erika Bjune

Core Work
Support Advancement in testing and planned Q3 campaigns
Make IDEAL payment processor campaign (support for Q4 campaigns)
Start Matching Gifts V2

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Aaron Halfaker

Content Integrity
JBuild/improve models in response to community demand (ongoing every quarter)
Machine Learning Infrastructure
Jade expansion/Iteration
Session-model use, maintenance, and user-research

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Leila Zia

Address Knowledge Gaps
Conduct a literature review, plan and set up collaborations for projects about understanding engagement with Wikimedia images around the world. (continued from Q2)
Build one formal collaborations in the disinformation space to start the research for building solutions starting Q3. (continued from Q2)

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Guillaume Lederrey

Core Work
1.1 New query parser is used in production by the end of Q3
2.2 WDQS storage expansion (continued from Q2)
7.1. Increase understanding of our work outside our team, and outside the Foundation
8.1. Improve search quality, especially for non-English wikis by prioritizing community requests - Positive feedback from speakers/community on changes made
10.1 Newcomer task
Wikidata
Improve WDQS updater performance
Machine Learning Infrastructure
3.1. Glent method 1 (comparison to other users' queries) offline tested, tuned, A/B tested and possibly deployed end of Q3 (continued from Q2)
Address Knowledge Gaps
6.1. Increase of training data retention (>90 days) is validated with Legal / Privacy (continued from Q2)
6.2. Any new data retention requirements are implemented (validate with Legal) (continued from Q2)
Structured Data
9.1. Proof of Concept SPARQL endpoint for SDoC is available on WMCS and updated weekly. (stretch) (continued from Q2)

Dependencies on: SDC, Legal

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: John Bennett

Core Work
Incident response Table Top and updates to security after action reports and improvement plans (continued from Q2)
Create design document for DAST implementation and development tools pen testing
Threat Intel/Hunt
NIST Assessments
Create or improve language-based best security practices documentation (continued from Q2)

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Directors: Mark Bergsma and Faidon Liambotis

Cross-cutting

Service Operations

Team Manager: Mark Bergsma

Core Work

Data Persistence

Team Manager: Mark Bergsma

Core Work

Traffic

Team Manager: Brandon Black

Core Work

Infrastructure Foundations

Team Manager: Faidon Liambotis

Core Work

Observability

Team Manager: Faidon Liambotis

Core Work

Data Center Operations

Team Manager: Willy Pao

Core Work
Modify existing dc-ops processes to be able to measure SLAs effectively
Create landing page, that directs end users to the various types of data center requests, its appropriate template, and expected turnaround time
Partner with Joel and Automation team to establish reports that can measure SLAs via Phabricator and/or Netbox
Define SLAs for each type of dc-ops tasks
Order and receive all Q3 hardware procurement orders by end of quarter
Clean out eqiad storage room and send all decommissioned equipment and unneeded parts for recycling by end of January
Partner with Julianne and Automation team to revamp decommission process of manually entering information into spreadsheet
Reduce total number of open data center tasks by 30%

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do


Team Manager: Birgit Müller

Developer Advocacy

Team Manager: Birgit Müller

Key Deliverables: Reduce Complexity of the Platform; Movement diversity

  • Create a blog by and for technical audiences where members of the technical community can post about their technical work
  • Publish 6 (min) technical blog posts
  • Create regular cadence of content -- strive for 3 x per week -- @MediaWiki and @Wikimediatech
  • Run Wikimedia Technical Talks -- increase views on talks by 10%
  • Prepare release of 3rd edition of the Tech Community Newsletter (publishing date: April 2020)
  • Make further improvements to the dashboard for Wikimedia Cloud Services edit data and announce it on targeted channels.
  • Publish current numbers on technical contributions provided by Bitergia in the Quarterly Tech Community newsletter (by Jan 2020)
  • Coordinate with Bitergia and get data on "Avg. Time Open (Days)" for Gerrit patchsets per affiliation and "time to first review" data for patches by end of Q4.
  • Find out what is needed to get data on technical contributions/contributors (by Q3)
  • Provide “showroom”, introducing newcomers to a variety of different tools to show what developers can do in Toolforge (by Q3, in collaboration with GCI students)
  • In Q2/Q3, at least 700 task instances are completed in Google Code-in.
  • At least five projects are successfully completed by Outreachy interns by end of Q3.
  • At least 12 projects are promoted in GSOC and Outreachy programs.
  • Kick off Friends of the Docs initiative (prep work in Q3; kick off in Q4)
  • Develop workshop concept with partner community for technical workshops in Q3/Q4
  • Provide continuous support for teams and individuals in Phabricator
  • Conduct at least 4 workshops + introductions into Phabricator at movement events by end of Q4
  • Establish Phabricator training for new staff members
  • A starter kit for small wikis containing a recommended set of templates, Gadgets, bots etc. is available by Q4

Wikimedia Cloud Services

Team Manager: Bryan Davis

tbd

Dependencies on:

Status

  • January 2020 status - To do To do
  • February 2020 status - To do To do
  • March 2020 status - To do To do