web.gekisaka.jp
robots.txt

Robots Exclusion Standard data for web.gekisaka.jp

Resource Scan

Scan Details

Site Domain web.gekisaka.jp
Base Domain gekisaka.jp
Scan Status Ok
Last Scan2024-11-06T18:18:27+00:00
Next Scan 2024-12-06T18:18:27+00:00

Last Scan

Scanned2024-11-06T18:18:27+00:00
URL https://web.gekisaka.jp/robots.txt
Domain IPs 108.156.133.25, 108.156.133.6, 108.156.133.61, 108.156.133.99
Response IP 108.156.133.99
Found Yes
Hash 0737c5a9b9343243a45ac6ab40ae331610d7991d18eb2a83c4e4ac81bc86ca46
SimHash 1955e0724fab

Groups

*

Rule Path
Disallow /search*

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

shortlinktranslate

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

kraken

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

adlessebot

Rule Path
Disallow /

maxpointcrawler

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

ndl-japan

Rule Path
Disallow /