myhirehop.com
robots.txt

Robots Exclusion Standard data for myhirehop.com

Resource Scan

Scan Details

Site Domain myhirehop.com
Base Domain myhirehop.com
Scan Status Ok
Last Scan2024-09-26T09:16:13+00:00
Next Scan 2024-10-26T09:16:13+00:00

Last Scan

Scanned2024-09-26T09:16:13+00:00
URL https://myhirehop.com/robots.txt
Domain IPs 13.33.30.111, 13.33.30.38, 13.33.30.55, 13.33.30.86
Response IP 13.33.30.55
Found Yes
Hash 3734bb18ddc86e0e0a3e84a0a319744deb0fa08f6051576f50a0c9f12f590e2c
SimHash 38903b1b25c9

Groups

mj12bot

Rule Path
Disallow /

accelobot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

bingbot

Rule Path
Disallow /modules/*
Disallow /php_functions/
Disallow /plugins/
Disallow /reports/
Disallow /home.php
Disallow /job.php
Disallow /pos.php
Disallow /project.php

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Disallow /modules/*
Disallow /php_functions/
Disallow /plugins/
Disallow /reports/
Disallow /home.php
Disallow /job.php
Disallow /pos.php
Disallow /project.php

Other Records

Field Value
sitemap http://myhirehop.com/sitemap.xml

Comments

  • BEGIN FILE
  • allow-all
  • The use of robots or other automated means to access the HireHop site
  • without the express permission of HireHop is strictly prohibited.
  • Notwithstanding the foregoing, HireHop may permit automated access to
  • access certain HireHop pages but soley for the limited purpose of
  • including content in publicly available search engines. Any other
  • use of robots or failure to obey the robots exclusion standards set
  • forth at <http://www.robotstxt.org/ wc/ exclusion.html> is strictly
  • prohibited.
  • v3
  • sitemaps
  • END FILE