oslist.ca
robots.txt
Robots Exclusion Standard data for oslist.ca
Resource Scan
Scan Details
Site Domain | oslist.ca |
Base Domain | oslist.ca |
Scan Status | Ok |
Last Scan | 2024-09-17T10:27:11+00:00 |
Next Scan | 2024-09-24T10:27:11+00:00 |
Last Scan
Scanned | 2024-09-17T10:27:11+00:00 |
URL | https://oslist.ca/robots.txt |
Redirect | https://www.oslist.ca/robots.txt |
Redirect Domain | www.oslist.ca |
Redirect Base | oslist.ca |
Domain IPs | 104.21.82.24, 172.67.151.92, 2606:4700:3033::ac43:975c, 2606:4700:3035::6815:5218 |
Redirect IPs | 104.21.82.24, 172.67.151.92, 2606:4700:3033::ac43:975c, 2606:4700:3035::6815:5218 |
Response IP | 104.21.82.24 |
Found | Yes |
Hash | ffbc42a8d68e74da1037fe48100132075f164475c9b37bbe20b0b22b32de50e4 |
SimHash | 87099b61cbbf |
Groups
*
Rule | Path |
---|---|
Allow | / |
Allow | /year |
Allow | /year/* |
Allow | /sec |
Allow | /sec/* |
Allow | /org |
Allow | /org/*/* |
Allow | /org/job/*/*/* |
Allow | /emp |
Allow | /emp/* |
Allow | /job |
Allow | /job/* |
Allow | /about |
Allow | /faqs |
Allow | /terms |
Disallow | /contact |
Disallow | /emp/json/* |
Disallow | /emp/json/*/* |
Disallow | /emp/json/*/*/* |
Disallow | /org/json/* |
Disallow | /org/json/*/* |
Disallow | /org/json/*/*/* |
Disallow | /org/json/*/*/*/* |
Disallow | /job/json/* |
Disallow | /job/json/*/* |
Disallow | /job/json/*/*/* |
Disallow | /job/json/*/*/*/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.oslist.ca/allseo-sitemap.xml |