inssagram.com
robots.txt

Robots Exclusion Standard data for inssagram.com

Resource Scan

Scan Details

Site Domain inssagram.com
Base Domain inssagram.com
Scan Status Ok
Last Scan2025-05-11T06:06:54+00:00
Next Scan 2025-06-10T06:06:54+00:00

Last Scan

Scanned2025-05-11T06:06:54+00:00
URL http://inssagram.com/robots.txt
Domain IPs 133.186.228.238
Response IP 133.186.228.238
Found Yes
Hash c7aff78fdeb79da98ff56205f130f2a2234b7e80797594f73272ee455b6d1894
SimHash a05410034782

Groups

*

Rule Path
Allow /
Allow /robots.txt
Disallow /admin/
Disallow /config/
Disallow /module/
Disallow /tmp/

mj12bot
semrushbot
claudebot
gptbot

Rule Path
Disallow /

facebookexternalhit
dotbot
screaming frog seo spide
heritrix
bingbot
owasp
geofeed
dirbuster-1.0-rc1
pfinapp
droid build
googlebot
cowbot
yeti
ads-naver
blueno
daumoa

Rule Path
Disallow /admin/
Disallow /config/
Disallow /data/
Disallow /module/
Disallow /tmp/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.inssagram.com/sitemap.xml

Comments

  • Default Bot Policy List provided by GODOMALL