muuseo.com
robots.txt

Robots Exclusion Standard data for muuseo.com

Resource Scan

Scan Details

Site Domain muuseo.com
Base Domain muuseo.com
Scan Status Ok
Last Scan2024-09-26T03:27:11+00:00
Next Scan 2024-10-03T03:27:11+00:00

Last Scan

Scanned2024-09-26T03:27:11+00:00
URL https://muuseo.com/robots.txt
Domain IPs 13.114.166.56, 54.238.105.44
Response IP 13.114.166.56
Found Yes
Hash c8e3c454963511d8df705357215932b604b56e7ac2bc38f50aed9c9eff59d39a
SimHash c21c08c5e644

Groups

googlebot

Rule Path
Disallow /diaries/new

bingbot

Rule Path
Disallow /diaries/new

Other Records

Field Value
crawl-delay 30

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /diaries/new

germcrawler

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /diaries/new

linespider

Rule Path
Disallow /diaries/new

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /admin/
Disallow /users/auth/

Other Records

Field Value
sitemap https://s3.ap-northeast-1.amazonaws.com/muuseo-jp/sitemaps/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file