spectrumnoticias.com
robots.txt
Robots Exclusion Standard data for spectrumnoticias.com
Resource Scan
Scan Details
| Site Domain | spectrumnoticias.com |
| Base Domain | spectrumnoticias.com |
| Scan Status | Ok |
| Last Scan | 2026-01-04T00:02:27+00:00 |
| Next Scan | 2026-01-11T00:02:27+00:00 |
Last Scan
| Scanned | 2026-01-04T00:02:27+00:00 |
| URL | https://spectrumnoticias.com/robots.txt |
| Domain IPs | 100.28.61.109, 18.213.203.208, 44.216.13.123, 52.21.98.4 |
| Response IP | 52.21.98.4 |
| Found | Yes |
| Hash | a9a5953c687b592f79fbcbc45499d11baba937adb4e4b200de7f20b8efd88936 |
| SimHash | 629e895ace94 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /$ |
| Allow | /ny/nyc |
| Allow | /ny/nyc/* |
| Allow | /tx/texas |
| Allow | /tx/texas/* |
| Allow | /ca/los-angeles |
| Allow | /ca/los-angeles/* |
| Allow | /us/noticias |
| Allow | /us/noticias/* |
| Allow | /fl/florida |
| Allow | /fl/florida/* |
| Allow | /sitemap.xml |
| Allow | /services/* |
| Allow | /content/* |
| Allow | /etc/* |
| Allow | /.well-known/assetlinks.json |
| Allow | /local |
| Allow | /splash |
| Allow | /etc.clientlibs/* |
| Disallow | /* |
| Disallow | /*/*/partner-content/* |
| Disallow | /content/news/stories/* |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 1 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://spectrumnoticias.com/sitemap.xml |
Comments