catch-newz.com
robots.txt
Robots Exclusion Standard data for catch-newz.com
Resource Scan
Scan Details
Site Domain | catch-newz.com |
Base Domain | catch-newz.com |
Scan Status | Ok |
Last Scan | 2024-09-25T10:54:10+00:00 |
Next Scan | 2024-10-02T10:54:10+00:00 |
Last Scan
Scanned | 2024-09-25T10:54:10+00:00 |
URL | https://catch-newz.com/robots.txt |
Domain IPs | 146.88.233.252 |
Response IP | 146.88.233.252 |
Found | Yes |
Hash | aef6be027d924db4f974757ee35b6c02abc74dd34c969daac867926b69b9abfb |
SimHash | e21e155943f5 |
Groups
*
Rule | Path |
---|---|
Allow | /*.js* |
Allow | /*.css* |
Allow | /*.png* |
Allow | /*.jpg* |
Allow | /*.gif* |
Disallow | /administrator/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cli/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /tmp/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.catch-newz.com/index.php?option=com_jmap&view=sitemap&format=xml |
Comments