Page MenuHomePhabricator

Story Idea for Blog: Automated detection of wikipedia censorship events
Closed, ResolvedPublic

Description

On my category of volunteer would like to publish this blogpost on the technical blog about automated censorship and outage detection, still WIP but probably ready for a 1st review.

Feel free to not like image selected, etc
https://docs.google.com/document/d/1w4VUujPdLt8NObzlPuHHp3MJ1eUfOKEl3OT3gXnPC9Y/edit

I am cc-ing @Slaporte cause this expands on work that we did couple years ago with the Center for Internet & Society at Harvard University and he might want to take a look.

Event Timeline

Nintendofan885 renamed this task from Story Idea for Blog: Automated detection of wikipedia censhorship events to Story Idea for Blog: Automated detection of wikipedia censorship events .Dec 3 2020, 2:33 PM
Nintendofan885 updated the task description. (Show Details)

@Nuria Good to hear from you, and awesome! I look forward to reading this! I'm will look at it early next week for suggestions and edits!!!

@Nuria I suggested a bunch of small changes. Can you please review and accept or decline? Overall, this looks good and is really interesting!

Hey all ~ Just a reminder that I will be out of the office until Jan 4 beginning this beginning Fri Dec 18.

Unless the revisions are finished by Friday morning, it is most likely this will be published in the first weeks of the new year.

@srodlund perfect, that gives me next week to finalize the text. The new year sounds great.

@Nuria Happy New Year! I'm back from vacation, so let me know when you believe the post is ready, and I'll take another pass at it.

@srodlund I think it is almost final! Accepted all your corrections and elaborated a bit on the conclusion. Please take a second look. Let me know if the tables are to be translated into images (or HTML tables) or how do you prefer to do that.

@Nuria sorry! I've been super busy and distracted this week (and missed the notification from Phab).

I checked, and it looks like the tables will copy over just fine, so no need to make any changes to them.

I will take a look at the unresolved comments and move this over to the blog!

Also, do you have an image you want to use for this post? I can pick one if you don't. (It should be a photo and have appropriate permissions for reuse).

@Nuria this is now ready to go. And I can publish it tomorrow (Friday) -- I just need a little more info about the images.

I picked this image but let me know if you would like something different: https://commons.wikimedia.org/wiki/File:Piata_Romana_-_Iarna.jpg

For the Globe with the read Censored banner in front of it, do you have the licensing/rights info?

For the Globe with the read Censored banner in front of it, do you have the licensing/rights info?

That's a draft I made, I downloaded the Wikipedia logo from commons, and added the "censored" label on top with the Gimp.

Okay, I'm going to call it a derivative of this logo (https://commons.wikimedia.org/wiki/File:Wikipedia-logo-v2.svg) and attribute the image with the censored label to you @mforns.

"derivative of logo" sounds good. No rush on publishing it whenever works for you.

@Nuria This is published! https://techblog.wikimedia.org/2021/01/15/censorship-outages-and-internet-shutdowns-monitoring-wikipedias-accessibility-around-the-world/

Can you take a look and let me know if everything looks good to you and if there is anything that needs correction?

Once I have your go ahead, I'll announce it more widely.

@Nuria This is published! https://techblog.wikimedia.org/2021/01/15/censorship-outages-and-internet-shutdowns-monitoring-wikipedias-accessibility-around-the-world/

Can you take a look and let me know if everything looks good to you and if there is anything that needs correction?

Once I have your go ahead, I'll announce it more widely.

Thanks for all your work on this, @srodlund!

Few minor things that we carried over from the document:

1). Towards the end of the page (last paragraph), I notice that there is an extra new line between "Now," and "the system is".

2). For the formula for entropy, can we copy-paste the image from the document? Pasting it as text makes it a bit more difficult to read and comprehend.

3). the "geographical distribution" should be The "geographical distribution".

Thanks, @ssingh! I've addressed these issues. I did have to replace the formula with an actual image, as the formatting was not copying over from the doc, and I wasn't able to format it correctly as text in the blog. Let me know if this looks okay to you!

Thanks, @ssingh! I've addressed these issues. I did have to replace the formula with an actual image, as the formatting was not copying over from the doc, and I wasn't able to format it correctly as text in the blog. Let me know if this looks okay to you!

Thanks, it looks great now!

@srodlund in mobile specially the initial paragraph : "The act of detecting anomalous events in a series of events (in this case a time series of Wikipedia pageviews) is called anomaly detection. The anomalies we are looking for are sudden drops in pageviews on a per-country basis." looks, I think, much too prominent, can we remove entirely so blogpost starts at "About four years ago"

@Nuria Hey hey ~ That is the summary, which I can't suppress. I can put different text in there though if you have a couple of sentences you think would best summarize the article; let me know, and I'll add them.

@srodlund I see, how about (probably a reworked version of)

"This article describes the methodology used by the Wikimedia Foundation to monitor outages on Wikipedia around the world, these events are called anomalies and could be due to various causes, among them censorship."

Great! I have updated the post with this text! Have a good weekend!

Thanks everybody. Especially @Nuria for putting all this together.

cc @Slaporte that blogpost about technical measures to detect censhorship is been published