
I'm currently developing a tagbot and running it every now and then. Here's what it's currently tagging:
Ratio | Reason | ||
---|---|---|---|
1:1 | Icons/Avatars | ||
2:1 | VR Resolution | ||
2:3 | Common Phone ratio | ||
3:1 | Twitter header image | ||
3:2 | Common Desktop ratio (Chromebook pixel, Microsoft Surface) | ||
4:1 | Twitter header image | ||
4:3 | Common Desktop ratio (especially on older monitors) | ||
5:3 | Common Phone ratio | ||
5:4 | Common Desktop ratio (especially on older monitors) | ||
5:6 | Included on https://e621.net/wiki_pages/27693[/td][/tr][tr][td][[7:4]] | Included on https://e621.net/wiki_pages/27693[/td][/tr][tr][td][[14:9]] | A compromise ratio, meant to give a decent image on both 4:3 and 16:9 TVs/monitors |
16:9 | Common Desktop ratio | ||
16:10 | Common Desktop ratio | ||
17:9 | Uncommon Desktop ratio | ||
18:9 | Uncommon Phone ratio | ||
18.5:9 | Uncommon Phone ratio | ||
19.5:9 | Uncommon Phone ratio | ||
21:9 | Uncommon Desktop ratio (UltraWide) | ||
32:9 | Uncommon Desktop ratio (Samsung) | ||
256:135 | Uncommon Desktop Ratio (Digital Cinema Initiatives 4K standard is 4096×2160) |
(Thanks to Idem for the original table/list that I added one or two extra ones to.)
Then also the reverse, so 16:9 and 9:16, as monitors can often be rotated.
Removing any ratio from 1:1 to 30:30 where they're mis-tagged.
Thumbnail/low_res/hi_res/absurd_res/superabsurd_res/4k
animated
animated_png
animated_comic
2_frame_animation
short_playtime
long_playtime
sound/no_sound/sound_warning (Only removes sound_warning)
high_framerate
alpha_channel
*_and_white
*_and_black
monochrome (added to the *_and_colour tags)
cool_colors (Being put in a private set to test)
warm_colors (Being put in a private set to test)
black_bars
letterbox
border
*_border
md5_mismatch
missing_sample
bad_metadata - MD5 Mismatch
bad_metadata - File sizes that are zero or negative
bad_metadata - Resolutions that are negative
Hashing visual data, to check for identical posts with different metadata.
4chan.org
i.4cdn.org
img.4cdn.org
artstation.com
aryion.com
behance.net
mir-s3-cdn-cf.behance.net
blogspot.com
(blog).blogspot.com
danbooru.donmai.us
derpibooru.org
deviantart.com
pre00.deviantart.net
api-da.wixmp.com
images-wixmp-ed30a86b8c4ca887773594c2.wixmp.com
discord.gg
discordapp.com
cdn.discordapp.com
media.discordapp.net
dropbox.com
dl.dropboxusercontent.com
e-hentai.org
facebook.com
web.facebook.com
fanart-central.net
pictures.fanart-central.net
flickr.com
live.staticflickr.com
furaffinity.net
d.facdn.net
t.facdn.net
a.facdn.net
furiffic.com
cdn.furiffic.com
furrylife.online
cdn.furrylife.online
furrynetwork.com
gelbooru.com
gfycat.com
hentai-foundry.com
imgur.com
i.imgur.com
inkbunny.net
metapix.net
(aa-zz/0-99).ib.metapix.net
instagram.com
fanart.lionking.org
newgrounds.com
art.ngfiles.com
uploads.ungrounded.net
blog.newtumbl.com
dn0.newtumbl.com
paheal.net
cache.paheal.net
rule34.paheal.net
rule34c.paheal.net
rule34-beta.paheal.net
rule34-images.paheal.net
rule34c-images.paheal.net
rule34-data-(000-999).paheal.net
*.paheal.net/_images
patreon.com
pixiv.net
pximg.net
i.pximg.net
puu.sh
reddit.com
rule34.xxx
shimmie.shishnet.org
sofurry.com
sofurryfiles.com
(artist).sofurry.com
steamcommunity.com
steamuserimages-a.akamaihd.net
transfur.com
trello.com
twitter.com
twimg.com
pbs.twimg.com
tumblr.com
media.tumblr.com
data.tumblr.com
(blog).tumblr.com
a.tumblr.com
s3.amazonaws.com/data.tumblr.com
(00-99).media.tumblr.com
u18chan.com
vk.com
pp.userapi.com
cdn.weasyl.com
wilddream.net
youtube.com
youtu.be
img.youtube.com
Posts with ancient_art and traditional_media_(artwork) aren't checked.
text
english_text
danish_text
dutch_text
finnish_text
french_text
german_text
italian_text
norwegian_text
portuguese_text
spanish_text
url
grawlixes
profanity
greeting
good_boy
good_girl
sound_effects
holiday_message
easter
christmas
haloween
valentine's_day
new_year
thanksgiving
father's_day
mother's_day
st._patrick's_day
Removing sex from these posts as bedroom_eyes and sex is mutually exclusive, and sex is rarely mistagged.
bedroom_eyes sex -solo -comic -animated -multiple_images -sketch_page -sequence
(Asked for by Versperus)
##########
Tags that probably won't be added for a while:
##########
Adding links to artist's donation/paysites that have URLs listed in their Artist tag.
patreon - (scanning/checking for the logo, probably with OpenCV)
##########
Tags that have either been removed or not implemented to avoid mistagging:
##########
apology
artist_name
character_name
dirty_talk
question
threat
wall_of_text
latin_text (removed as it caused false positives)
pet_praise
?!
(colour)_theme - shouldn't include single colour backgrounds, but it's hard to differentiate the background from the foreground programmatically.
Removing unknown_artist/unknown_artist_signature where an artist is tagged. - Could remove the tag when there's an art collaboration where one of the artists is unknown.
pillarbox - because it's a pain to tag. A lot were 1px pillarboxed with a faint difference in colour, making it hard to tell if it was correctly tagged or not.
Removing invalid_tag/invalid_color/invalid_background/invalid_character where they're on posts. - Forum topic #29363
These were one-off things, but worth mentioning.
Added 28k Derpibooru sources to posts where they were missing.
(Thanks to Byte[] for a list of MD5s of Derpibooru's posts.)
Added 57k FurryNetwork sources to posts where they were missing.
(Thanks to Idem for a list of FN pages, direct links and E6 IDs.)
These are Firefox addons, but a few also have versions for Chrome:
uBlock Origin - Ad blocker
HTTPS Everywhere - Checks websites for encrypted versions
Privacy Badger - Blocks invisible trackers and/or cookies
Decentraleyes - Saves copies of jquery and other libraries, so those sites can't track you
Disable WebRTC - Stops your browser leaking your IP if you're behind a vpn or proxy.
If you only use Google for their search engine I'd recommend https://www.startpage.com/ which acts as a proxy to Google, similar to DuckDuckGo using Bing's search results, letting you search more privately.