newsbite.it
robots.txt
Robots Exclusion Standard data for newsbite.it
Resource Scan
Scan Details
| Site Domain | newsbite.it |
| Base Domain | newsbite.it |
| Scan Status | Ok |
| Last Scan | 2026-02-10T05:36:22+00:00 |
| Next Scan | 2026-02-17T05:36:22+00:00 |
Last Scan
| Scanned | 2026-02-10T05:36:22+00:00 |
| URL | https://newsbite.it/robots.txt |
| Domain IPs | 104.21.78.39, 172.67.215.189, 2606:4700:3034::6815:4e27, 2606:4700:3035::ac43:d7bd |
| Response IP | 104.21.78.39 |
| Found | Yes |
| Hash | 35dd6e5f57e00d860ba48def4170f2c86c16c60d9dbab78f2bd0bbcd2b7d9c38 |
| SimHash | 46350953cd94 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Disallow | /readme.html |
| Disallow | /license.txt |
| Disallow | /wp-admin/admin-ajax.php |
Other Records
| Field | Value |
|---|---|
| sitemap | https://newsbite.it/sitemap.xml |
Warnings
- `content-signal` is not a known field.
Comments