muuseo.com
robots.txt

Robots Exclusion Standard data for muuseo.com

Resource Scan

Scan Details

Site Domain muuseo.com
Base Domain muuseo.com
Scan Status Ok
Last Scan2024-11-14T04:40:19+00:00
Next Scan 2024-11-21T04:40:19+00:00

Last Scan

Scanned2024-11-14T04:40:19+00:00
URL https://muuseo.com/robots.txt
Domain IPs 54.150.161.83, 54.238.30.24
Response IP 54.238.30.24
Found Yes
Hash 2bc7223eb55e5986056be18f12f78babfea6fb26e37bd6d2b03b4e0d6d9f6f28
SimHash c21c19c5e644

Groups

googlebot

Rule Path
Disallow /diaries/new

bingbot

Rule Path
Disallow /diaries/new
Disallow /search

Other Records

Field Value
crawl-delay 30

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /diaries/new

germcrawler

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /diaries/new

linespider

Rule Path
Disallow /diaries/new

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /admin/
Disallow /users/auth/

Other Records

Field Value
sitemap https://s3.ap-northeast-1.amazonaws.com/muuseo-jp/sitemaps/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file