theworks.jobs
robots.txt

Robots Exclusion Standard data for theworks.jobs

Resource Scan

Scan Details

Site Domain theworks.jobs
Base Domain theworks.jobs
Scan Status Ok
Last Scan2024-11-03T10:32:04+00:00
Next Scan 2024-11-17T10:32:04+00:00

Last Scan

Scanned2024-11-03T10:32:04+00:00
URL https://theworks.jobs/robots.txt
Redirect https://www.theworks.jobs/robots.txt
Redirect Domain www.theworks.jobs
Redirect Base theworks.jobs
Domain IPs 104.21.44.81, 172.67.197.149, 2606:4700:3030::6815:2c51, 2606:4700:3037::ac43:c595
Redirect IPs 207.120.40.5, 207.120.40.8, 207.120.43.13, 207.120.43.2, 207.120.43.3, 207.120.43.4, 207.120.43.6, 207.120.43.9
Response IP 207.120.40.5
Found Yes
Hash b298b2c4ec02d39b87c486249a689ac74d6c2dd5f10e1da5d6d5979c1a5338bd
SimHash 82521b8de673

Groups

aihitdata

Rule Path
Disallow /

*

Rule Path
Disallow /app/
Disallow /messages/
Disallow /messenger/
Disallow /facebook/tab/
Disallow /jobs/internal/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file