istgah.com
robots.txt

Robots Exclusion Standard data for istgah.com

Resource Scan

Scan Details

Site Domain istgah.com
Base Domain istgah.com
Scan Status Ok
Last Scan2024-11-14T06:47:23+00:00
Next Scan 2024-12-14T06:47:23+00:00

Last Scan

Scanned2024-11-14T06:47:23+00:00
URL https://istgah.com/robots.txt
Redirect https://www.istgah.com/robots.txt
Redirect Domain www.istgah.com
Redirect Base istgah.com
Domain IPs 37.156.144.123
Redirect IPs 37.156.144.123
Response IP 37.156.144.123
Found Yes
Hash 008fc9698c952f7d6fd307722fded99edb6aea1130683001bef61f65d6a4afb7
SimHash 025ccb104792

Groups

*

Rule Path
Allow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

exabot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

covario

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

Warnings

  • 2 invalid lines.