cdn-new.jobhouse.jp
robots.txt

Robots Exclusion Standard data for cdn-new.jobhouse.jp

Resource Scan

Scan Details

Site Domain cdn-new.jobhouse.jp
Base Domain jobhouse.jp
Scan Status Ok
Last Scan2024-11-14T04:17:18+00:00
Next Scan 2024-12-14T04:17:18+00:00

Last Scan

Scanned2024-11-14T04:17:18+00:00
URL https://cdn-new.jobhouse.jp/robots.txt
Domain IPs 108.156.133.103, 108.156.133.123, 108.156.133.36, 108.156.133.9, 2600:9000:2755:600:15:c267:8680:93a1, 2600:9000:2755:6a00:15:c267:8680:93a1, 2600:9000:2755:7a00:15:c267:8680:93a1, 2600:9000:2755:9a00:15:c267:8680:93a1, 2600:9000:2755:ae00:15:c267:8680:93a1, 2600:9000:2755:b000:15:c267:8680:93a1, 2600:9000:2755:d800:15:c267:8680:93a1, 2600:9000:2755:f200:15:c267:8680:93a1
Response IP 108.156.133.36
Found Yes
Hash 8df603be8beb731c394dec219eb4e824ec026d28bbe7dc203a64e22836a3f1f6
SimHash b21d0d0ddf50

Groups

*

Rule Path
Disallow /*?*
Disallow /factory/entry/*

Other Records

Field Value
sitemap https://jobhouse.jp/sitemap
sitemap https://jobhouse.jp/driver/sitemap_search
sitemap https://jobhouse.jp/driver/sitemap_articles
sitemap https://jobhouse.jp/driver/sitemap_posts
sitemap https://jobhouse.jp/driver/sitemap_others
sitemap https://jobhouse.jp/factory/sitemap_search
sitemap https://jobhouse.jp/factory/sitemap_articles
sitemap https://jobhouse.jp/factory/sitemap_posts
sitemap https://jobhouse.jp/factory/sitemap_others

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: