Pup 2 Joined PRIVILEGED
Blacklisted
About

I'm currently developing a tagbot and running it every now and then. Here's what it's currently tagging:

Tagbot stuff:
Currently tagging:
Ratios:
Ratio Reason
1:1 Icons/Avatars
2:1 VR Resolution
2:3 Common Phone ratio
3:1 Twitter header image
3:2 Common Desktop ratio (Chromebook pixel, Microsoft Surface)
4:1 Twitter header image
4:3 Common Desktop ratio (especially on older monitors)
5:3 Common Phone ratio
5:4 Common Desktop ratio (especially on older monitors)
5:6 Included on https://e621.net/wiki_pages/27693[/td][/tr][tr][td][[7:4]] Included on https://e621.net/wiki_pages/27693[/td][/tr][tr][td][[14:9]] A compromise ratio, meant to give a decent image on both 4:3 and 16:9 TVs/monitors
16:9 Common Desktop ratio
16:10 Common Desktop ratio
17:9 Uncommon Desktop ratio
18:9 Uncommon Phone ratio
18.5:9 Uncommon Phone ratio
19.5:9 Uncommon Phone ratio
21:9 Uncommon Desktop ratio (UltraWide)
32:9 Uncommon Desktop ratio (Samsung)
256:135 Uncommon Desktop Ratio (Digital Cinema Initiatives 4K standard is 4096×2160)

(Thanks to Idem for the original table/list that I added one or two extra ones to.)
Then also the reverse, so 16:9 and 9:16, as monitors can often be rotated.
Removing any ratio from 1:1 to 30:30 where they're mis-tagged.

Thumbnail/low_res/hi_res/absurd_res/superabsurd_res/4k

huge_filesize

animated
animated_png
animated_comic
2_frame_animation
short_playtime
long_playtime
sound/no_sound/sound_warning (Only removes sound_warning)
high_framerate

flash

alpha_channel
*_and_white
*_and_black
monochrome (added to the *_and_colour tags)

cool_colors (Being put in a private set to test)
warm_colors (Being put in a private set to test)

black_bars
letterbox
border
*_border

qr_code

md5_mismatch
missing_sample
bad_metadata - MD5 Mismatch
bad_metadata - File sizes that are zero or negative
bad_metadata - Resolutions that are negative

Hashing visual data, to check for identical posts with different metadata.

Swapping http:// to https:// in sources and descriptions for these urls:

4chan.org
i.4cdn.org
img.4cdn.org

artstation.com

aryion.com

behance.net
mir-s3-cdn-cf.behance.net

blogspot.com
(blog).blogspot.com

danbooru.donmai.us

derpibooru.org

deviantart.com
pre00.deviantart.net
api-da.wixmp.com
images-wixmp-ed30a86b8c4ca887773594c2.wixmp.com

discord.gg
discordapp.com
cdn.discordapp.com
media.discordapp.net

dropbox.com
dl.dropboxusercontent.com

e-hentai.org

facebook.com
web.facebook.com

fanart-central.net
pictures.fanart-central.net

flickr.com
live.staticflickr.com

furaffinity.net
d.facdn.net
t.facdn.net
a.facdn.net

furiffic.com
cdn.furiffic.com

furrylife.online
cdn.furrylife.online

furrynetwork.com

gelbooru.com

gfycat.com

hentai-foundry.com

imgur.com
i.imgur.com

inkbunny.net
metapix.net
(aa-zz/0-99).ib.metapix.net

instagram.com

fanart.lionking.org

newgrounds.com
art.ngfiles.com
uploads.ungrounded.net

blog.newtumbl.com
dn0.newtumbl.com

paheal.net
cache.paheal.net
rule34.paheal.net
rule34c.paheal.net
rule34-beta.paheal.net
rule34-images.paheal.net
rule34c-images.paheal.net
rule34-data-(000-999).paheal.net
*.paheal.net/_images

patreon.com

pixiv.net
pximg.net
i.pximg.net

puu.sh

reddit.com

rule34.xxx

shimmie.shishnet.org

sofurry.com
sofurryfiles.com
(artist).sofurry.com

steamcommunity.com
steamuserimages-a.akamaihd.net

transfur.com

trello.com

twitter.com
twimg.com
pbs.twimg.com

tumblr.com
media.tumblr.com
data.tumblr.com
(blog).tumblr.com
a.tumblr.com
s3.amazonaws.com/data.tumblr.com
(00-99).media.tumblr.com

u18chan.com

vk.com
pp.userapi.com

cdn.weasyl.com

wilddream.net

youtube.com
youtu.be
img.youtube.com

OCR - Text Recognition
Other tagging scripts:

Removing sex from these posts as bedroom_eyes and sex is mutually exclusive, and sex is rarely mistagged.
bedroom_eyes sex -solo -comic -animated -multiple_images -sketch_page -sequence
(Asked for by Versperus)

Tags I'm working on implementing:

##########
Tags that probably won't be added for a while:
##########

greyscale

barcode

Adding links to artist's donation/paysites that have URLs listed in their Artist tag.

patreon - (scanning/checking for the logo, probably with OpenCV)

##########
Tags that have either been removed or not implemented to avoid mistagging:
##########

OCR - Text Recognition

(colour)_theme - shouldn't include single colour backgrounds, but it's hard to differentiate the background from the foreground programmatically.

Removing unknown_artist/unknown_artist_signature where an artist is tagged. - Could remove the tag when there's an art collaboration where one of the artists is unknown.

pillarbox - because it's a pain to tag. A lot were 1px pillarboxed with a faint difference in colour, making it hard to tell if it was correctly tagged or not.

Removing invalid_tag/invalid_color/invalid_background/invalid_character where they're on posts. - Forum topic #29363

Adding sources to posts:

These were one-off things, but worth mentioning.

Added 28k Derpibooru sources to posts where they were missing.
(Thanks to Byte[] for a list of MD5s of Derpibooru's posts.)

Added 57k FurryNetwork sources to posts where they were missing.
(Thanks to Idem for a list of FN pages, direct links and E6 IDs.)

Some privacy browser addons:

These are Firefox addons, but a few also have versions for Chrome:
uBlock Origin - Ad blocker
HTTPS Everywhere - Checks websites for encrypted versions
Privacy Badger - Blocks invisible trackers and/or cookies
Decentraleyes - Saves copies of jquery and other libraries, so those sites can't track you
Disable WebRTC - Stops your browser leaking your IP if you're behind a vpn or proxy.

If you only use Google for their search engine I'd recommend https://www.startpage.com/ which acts as a proxy to Google, similar to DuckDuckGo using Bing's search results, letting you search more privately.

post #5399173
↑187♥350C1E
post #4070127
↑167♥357C0E
post #4070121
↑421♥930C6E
post #4063864
↑114♥227C1E
post #4063814
↑750♥1887C2E
post #4051032
↑950♥1928C23E
post #4051026
↑393♥724C0E
post #4027956
↑536♥1283C1E