capedcu.com
robots.txt
Robots Exclusion Standard data for capedcu.com
Resource Scan
Scan Details
Site Domain | capedcu.com |
Base Domain | capedcu.com |
Scan Status | Ok |
Last Scan | 2024-05-19T00:11:57+00:00 |
Next Scan | 2024-06-18T00:11:57+00:00 |
Last Scan
Scanned | 2024-05-19T00:11:57+00:00 |
URL | https://capedcu.com/robots.txt |
Domain IPs | 104.22.14.133, 104.22.15.133, 172.67.14.64, 2606:4700:10::6816:e85, 2606:4700:10::6816:f85, 2606:4700:10::ac43:e40 |
Response IP | 104.22.15.133 |
Found | Yes |
Hash | 84cdeef27ad3f516df73dc94c77a029171a3f367a7a407f5188130a013f5c4de |
SimHash | 014c78018f93 |
Groups
*
Rule | Path |
---|---|
Disallow | /learn/blog/category/ |
Disallow | /learn/blog/pg/ |
Disallow | /es/learn/blog/pg/ |
Disallow | /es/learn/blog/category/ |
Other Records
Field | Value |
---|---|
sitemap | https://capedcu.com/sitemap.xml |