wbhrb.org
robots.txt
Robots Exclusion Standard data for wbhrb.org
Resource Scan
Scan Details
Site Domain | wbhrb.org |
Base Domain | wbhrb.org |
Scan Status | Ok |
Last Scan | 2024-09-25T13:02:31+00:00 |
Next Scan | 2024-10-02T13:02:31+00:00 |
Last Scan
Scanned | 2024-09-25T13:02:31+00:00 |
URL | https://wbhrb.org/robots.txt |
Redirect | https://www.impekaedu.org/robots.txt |
Redirect Domain | www.impekaedu.org |
Redirect Base | impekaedu.org |
Domain IPs | 104.26.0.72, 104.26.1.72, 172.67.70.118, 2606:4700:20::681a:148, 2606:4700:20::681a:48, 2606:4700:20::ac43:4676 |
Redirect IPs | 104.21.74.171, 172.67.160.83, 2606:4700:3032::6815:4aab, 2606:4700:3033::ac43:a053 |
Response IP | 104.21.74.171 |
Found | Yes |
Hash | 972d8c1e543078724008ed4ac0983fe3d410707f6d473da034fb1a0b4fe52ae3 |
SimHash | ee17d030fd90 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-json/ |
Disallow | /?s=* |
Disallow | /search/* |
Disallow | /cdn-cgi/bm/cv/ |
Disallow | /cdn-cgi/challenge-platform/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.wbhrb.in/sitemap_index.xml |
Comments