santora.jp
robots.txt

Robots Exclusion Standard data for santora.jp

Resource Scan

Scan Details

Site Domain santora.jp
Base Domain santora.jp
Scan Status Ok
Last Scan2024-09-20T13:01:07+00:00
Next Scan 2024-09-27T13:01:07+00:00

Last Scan

Scanned2024-09-20T13:01:07+00:00
URL http://santora.jp/robots.txt
Domain IPs 118.243.82.226
Response IP 118.243.82.226
Found Yes
Hash 693cce47c596415ec849052ac0f4dee7478cfca0025925c3223a89cbad9d3394
SimHash 02b6c4c087d9

Groups

yetibot
psbot
ia_archiver
becomebot
becomejpbot
turnitinbot
e-societyrobot
irlbot
mozilla/2.0 (compatible; ask jeeves/teoma)
lc-crawler
discobot
speedy
ccbot
yacy
mlbot
dotbot
scoutjet
gonzo
voyager
archive_crawler
archive.org_bot
steeler
wbsearchbot
metamojicrawler
wotbox
yandex
blexbot
spbot
riddler
siteexplorer
proximic
semrushbot
grapeshot
piplbot
gptbot
amazonbot
ias_crawler
meta-externalagent

Rule Path
Disallow /

*

Rule Path
Disallow /asinimg/
Disallow /asis/
Disallow /comment/
Disallow /itunes/
Disallow /search/

baiduspider
bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

Warnings

  • 2 invalid lines.