careerguide.com
robots.txt
Robots Exclusion Standard data for careerguide.com
Resource Scan
Scan Details
| Site Domain | careerguide.com |
| Base Domain | careerguide.com |
| Scan Status | Ok |
| Last Scan | 2025-12-05T16:57:24+00:00 |
| Next Scan | 2025-12-12T16:57:24+00:00 |
Last Scan
| Scanned | 2025-12-05T16:57:24+00:00 |
| URL | https://careerguide.com/robots.txt |
| Redirect | https://www.careerguide.com/robots.txt |
| Redirect Domain | www.careerguide.com |
| Redirect Base | careerguide.com |
| Domain IPs | 104.21.70.106, 172.67.222.220, 2606:4700:3034::ac43:dedc, 2606:4700:3035::6815:466a |
| Redirect IPs | 104.21.70.106, 172.67.222.220, 2606:4700:3034::ac43:dedc, 2606:4700:3035::6815:466a |
| Response IP | 172.67.222.220 |
| Found | Yes |
| Hash | caf049d995cad12a860e1afbd2697addc22ae9014b635cc80e13b3f17e88aa32 |
| SimHash | f10059404f83 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /App_Code/ |
| Disallow | /bin/ |
| Disallow | /App_Data/ |
| Disallow | /Data/ |
| Disallow | /Handlers/ |
| Disallow | /Scripts/ |
| Disallow | /WebServices/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.careerguide.com/sitemap.xml |
| sitemap | https://www.careerguide.com/blog/sitemap-blog-amp.xml |
| sitemap | https://www.careerguide.com/blog/sitemap_index.xml |