arine.jp
robots.txt

Robots Exclusion Standard data for arine.jp

Resource Scan

Scan Details

Site Domain arine.jp
Base Domain arine.jp
Scan Status Ok
Last Scan2024-11-01T14:59:23+00:00
Next Scan 2024-11-08T14:59:23+00:00

Last Scan

Scanned2024-11-01T14:59:23+00:00
URL https://arine.jp/robots.txt
Domain IPs 108.157.254.116, 108.157.254.42, 108.157.254.63, 108.157.254.94
Response IP 108.157.254.116
Found Yes
Hash d37ff366fc234d5c066c3352ab8a12d15efdd44b7429eb1185df1593f3d9cb09
SimHash b28d0d856550

Groups

*

Rule Path
Disallow /search*
Disallow /api/v1/pageviews

screaming frog seo spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://s3-ap-northeast-1.amazonaws.com/gree-luccy-assets/sitemaps/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /