corpbiz.io
robots.txt

Robots Exclusion Standard data for corpbiz.io

Resource Scan

Scan Details

Site Domain corpbiz.io
Base Domain corpbiz.io
Scan Status Ok
Last Scan2024-09-20T08:04:12+00:00
Next Scan 2024-10-20T08:04:12+00:00

Last Scan

Scanned2024-09-20T08:04:12+00:00
URL https://corpbiz.io/robots.txt
Domain IPs 104.26.4.226, 104.26.5.226, 172.67.73.226, 2606:4700:20::681a:4e2, 2606:4700:20::681a:5e2, 2606:4700:20::ac43:49e2
Response IP 104.26.4.226
Found Yes
Hash a63a5a281de5807ad664aca386e7ca744029a4e5f528bb24d310529c3e3be90d
SimHash e1046f53879b

Groups

*

Rule Path
Disallow /admin/
Disallow /lp/
Disallow /login
Disallow /reset-password
Disallow /test
Disallow /admin/style/images/userfiles/file/
Allow /admin/style/images/
Disallow /*/amp
Disallow /*/?nonamp=1
Disallow /*/?utm_
Disallow /*?utm_
Disallow /*?*

Other Records

Field Value
sitemap https://corpbiz.io/sitemap.xml
sitemap https://corpbiz.io/learning/sitemap_index.xml