profguide.io
robots.txt

Robots Exclusion Standard data for profguide.io

Resource Scan

Scan Details

Site Domain profguide.io
Base Domain profguide.io
Scan Status Ok
Last Scan2024-11-15T21:44:11+00:00
Next Scan 2024-11-22T21:44:11+00:00

Last Scan

Scanned2024-11-15T21:44:11+00:00
URL https://profguide.io/robots.txt
Redirect https://www.profguide.io/robots.txt
Redirect Domain www.profguide.io
Redirect Base profguide.io
Domain IPs 185.38.19.15
Redirect IPs 185.38.19.15
Response IP 185.38.19.15
Found Yes
Hash 54bfa42c12b5e41ba6556c5c918124067b31f43ee7b18864a79c03d72dc88148
SimHash 94351d05b817

Groups

*

Rule Path
Disallow /site/*
Disallow /en/site/*
Disallow /my/*
Disallow /guide/*
Disallow /access/*
Disallow /search/*
Disallow /en/search/*
Disallow /test/api/*
Disallow /test/api2/*
Disallow /test/result/*
Disallow /en/test/result/*
Disallow /test/*?p=*
Disallow /item%5B%5D%3D*
Disallow /professions/alpha/
Disallow /professions/search/*
Disallow /professions/objects/search/?id=*
Disallow /proforientation/consultations/proftest/
Disallow /test/proftest-profguide.html
Disallow /proforientation/consultations/test-adult/
Disallow /proforientation/consultations/test-teen/
Disallow /cdn-cgi/
Disallow /*?utm_source=*
Disallow /*?redirect=*
Disallow /*%26redirect%3D*
Disallow /*?sort=*

Other Records

Field Value
sitemap https://www.profguide.io/sitemap.xml

Warnings

  • `clean-param` is not a known field.