archive.sudomemo.net
robots.txt

Robots Exclusion Standard data for archive.sudomemo.net

Resource Scan

Scan Details

Site Domain archive.sudomemo.net
Base Domain sudomemo.net
Scan Status Ok
Last Scan2025-09-29T13:45:13+00:00
Next Scan 2025-10-29T13:45:13+00:00

Last Scan

Scanned2025-09-29T13:45:13+00:00
URL https://archive.sudomemo.net/robots.txt
Domain IPs 151.101.130.217, 151.101.194.217, 151.101.2.217, 151.101.66.217
Response IP 199.232.46.217
Found Yes
Hash e7e2d5dc83943ed65b37de2dde21761a8c73f5f38c028887feccdc650b7d07eb
SimHash 601cc2a3eab1

Groups

*

Rule Path
Allow /
Disallow /watch/embed/*

claudebot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /theatre_assets/statistics_mainpage.php

yandex

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /theatre_assets/images/dynamic/*

mj12bot

Rule Path
Disallow /*

ahrefsbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /watch/*

Comments

  • Sorry Yandex but you didn't play nice

Warnings

  • 1 invalid line.