cbm.org
robots.txt
Robots Exclusion Standard data for cbm.org
Resource Scan
Scan Details
Site Domain | cbm.org |
Base Domain | cbm.org |
Scan Status | Ok |
Last Scan | 2024-09-30T14:19:13+00:00 |
Next Scan | 2024-10-30T14:19:13+00:00 |
Last Scan
Scanned | 2024-09-30T14:19:13+00:00 |
URL | https://cbm.org/robots.txt |
Redirect | https://www.cbm.org/robots.txt |
Redirect Domain | www.cbm.org |
Redirect Base | cbm.org |
Domain IPs | 168.119.144.203 |
Redirect IPs | 168.119.144.203 |
Response IP | 168.119.144.203 |
Found | Yes |
Hash | 3b8efabe988d46cc97d896a31875de41cddf0a9f529c4bb5f124b7b1959d53f6 |
SimHash | 76db72860d26 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /*?id=* |
Disallow | /*%26id%3D* |
Disallow | /*/Private/* |
Disallow | /*/Configuration/* |
Disallow | /typo3temp/* |
Allow | /typo3temp/*.css |
Allow | /typo3temp/*.css.*.gzip |
Allow | /typo3temp/*.js |
Allow | /typo3temp/*.js.*.gzip |
Allow | /typo3temp/*.jpg |
Allow | /typo3temp/*.gif |
Allow | /typo3temp/*.png |
Disallow | *.sql |
Disallow | *.sql.gz |
Other Records
Field | Value |
---|---|
sitemap | https://www.cbm.org/sitemap.xml |