manga-tube.me
robots.txt

Robots Exclusion Standard data for manga-tube.me

Resource Scan

Scan Details

Site Domain manga-tube.me
Base Domain manga-tube.me
Scan Status Ok
Last Scan2024-05-24T13:45:45+00:00
Next Scan 2024-05-31T13:45:45+00:00

Last Scan

Scanned2024-05-24T13:45:45+00:00
URL https://manga-tube.me/robots.txt
Domain IPs 104.21.71.173, 172.67.171.21, 2606:4700:3030::ac43:ab15, 2606:4700:3033::6815:47ad
Response IP 172.67.171.21
Found Yes
Hash 36cb658c595a3e85dd9366c8b38206b6c9ca0a0e8837ad7ad962b5e535b7b4e8
SimHash cb1859f0cf53

Groups

mj12bot
megaindex.ru/2.0
megaindex.ru
megaindex.ru
cloudendure scanner (ops@cloudendure.com)
archive.org_bot
mail.ru_bot
obot
istellabot
easouspider
dbot
zookabot
proximic
crystalsemanticsbot
larbin
blexbot
ubicrawler
unisterbot
doc
zao
zealbot
msiecrawler
fetch
offline explorer
teleport
teleportpro
webzip
linko
microsoft.url.control
xenu
larbin
zyborg
download ninja
grub-client
k2spider
npbot
smartviper
urlmetriken
psbot
gigabot
baiduspider
nutch
cityreview
penthesilea
prlog
libwww
zyborg
sogou web spider
java
add catalog
voilabot
vagabondo
turnitinbot
nerdybot
seznambot
wsr-agent
coccoc
careerbot
cpython
meanpathbot
euripbot
python-requests
ahrefsbot

Rule Path
Disallow /

yandexbot
baiduspider
yandex
exabot

Rule Path
Disallow /

sitesnagger
webstripper
webcopier
webreaper
httrack

Rule Path
Disallow /

emailcollector
webemailextrac
trackback
emailsiphon
emailspider
emailwolf

Rule Path
Disallow /

trident

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /series/search/
Disallow /profile/
Disallow /affliate/

Comments

  • sitemap: https://manga-tube.me/sitemap.xml
  • Boese Bots
  • Ausländische Suchmaschinen
  • Web Scraper
  • Email Scraper
  • undefined crawler
  • Trusted crawler

Warnings

  • 1 invalid line.