corpbiz.io
robots.txt
Robots Exclusion Standard data for corpbiz.io
Resource Scan
Scan Details
Site Domain | corpbiz.io |
Base Domain | corpbiz.io |
Scan Status | Ok |
Last Scan | 2024-09-20T08:04:12+00:00 |
Next Scan | 2024-10-20T08:04:12+00:00 |
Last Scan
Scanned | 2024-09-20T08:04:12+00:00 |
URL | https://corpbiz.io/robots.txt |
Domain IPs | 104.26.4.226, 104.26.5.226, 172.67.73.226, 2606:4700:20::681a:4e2, 2606:4700:20::681a:5e2, 2606:4700:20::ac43:49e2 |
Response IP | 104.26.4.226 |
Found | Yes |
Hash | a63a5a281de5807ad664aca386e7ca744029a4e5f528bb24d310529c3e3be90d |
SimHash | e1046f53879b |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /lp/ |
Disallow | /login |
Disallow | /reset-password |
Disallow | /test |
Disallow | /admin/style/images/userfiles/file/ |
Allow | /admin/style/images/ |
Disallow | /*/amp |
Disallow | /*/?nonamp=1 |
Disallow | /*/?utm_ |
Disallow | /*?utm_ |
Disallow | /*?* |
Other Records
Field | Value |
---|---|
sitemap | https://corpbiz.io/sitemap.xml |
sitemap | https://corpbiz.io/learning/sitemap_index.xml |