workventure.com
robots.txt

Robots Exclusion Standard data for workventure.com

Resource Scan

Scan Details

Site Domain workventure.com
Base Domain workventure.com
Scan Status Ok
Last Scan2024-11-11T19:05:36+00:00
Next Scan 2024-11-18T19:05:36+00:00

Last Scan

Scanned2024-11-11T19:05:36+00:00
URL https://workventure.com/robots.txt
Redirect https://www.workventure.com/robots.txt
Redirect Domain www.workventure.com
Redirect Base workventure.com
Domain IPs 104.21.8.219, 172.67.140.72, 2606:4700:3035::6815:8db, 2606:4700:3035::ac43:8c48
Redirect IPs 104.21.8.219, 172.67.140.72, 2606:4700:3035::6815:8db, 2606:4700:3035::ac43:8c48
Response IP 172.67.140.72
Found Yes
Hash bd18a17f735346b58cfb4cf9e6c554826afef8e3846328cda4aa4889d22d3123
SimHash 296cccc66715

Groups

*

Rule Path
Allow /
Disallow /term
Disallow /password/reset
Disallow /index.php/
Disallow /index.php
Disallow /student/signup
Disallow /en/student/signup
Disallow /index.html
Disallow /th*/

googlebot

Rule Path
Disallow /jooble.xml
Disallow /recruit.xml
Disallow /recruitpromo.xml
Disallow /indeed.xml
Disallow /indeedpromo.xml
Disallow /jobs77.xml
Disallow /incruit.xml
Disallow /student/signup
Disallow /en/student/signup
Disallow /en/opt-out
Disallow /opt-out
Disallow /careerjet.xml