cuatro.com
robots.txt
Robots Exclusion Standard data for cuatro.com
Resource Scan
Scan Details
Site Domain | cuatro.com |
Base Domain | cuatro.com |
Scan Status | Ok |
Last Scan | 2024-05-24T19:34:25+00:00 |
Next Scan | 2024-05-31T19:34:25+00:00 |
Last Scan
Scanned | 2024-05-24T19:34:25+00:00 |
URL | https://cuatro.com/robots.txt |
Redirect | https://www.cuatro.com/robots.txt |
Redirect Domain | www.cuatro.com |
Redirect Base | cuatro.com |
Domain IPs | 34.243.193.197, 34.251.122.225, 52.209.112.80 |
Redirect IPs | 173.222.147.215 |
Response IP | 23.202.142.39 |
Found | Yes |
Hash | 63155c46541e3bf9067571f5487ef1d91cec5ae9b82c9892e5a1fd7f171599a9 |
SimHash | 8b2bdc808277 |
Groups
*
Rule | Path |
---|---|
Disallow | /buscador/*?text=* |
Disallow | /api/cms/ |
Disallow | /mdswebservice/ |
Disallow | /mdsvideo/ |
Disallow | /mdsads/ |
Disallow | /stats.html |
Disallow | /api/boards |
Disallow | /tags/*?text=* |
Disallow | /autores/*?text=* |
Disallow | /personajes/*?text=* |
Disallow | /20d/ |
Disallow | /20p/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cuatro.com/sitemap_index.xml |
sitemap | https://www.cuatro.com/sitemap_ampstories.xml |
sitemap | https://www.cuatro.com/sitemap_google_news.xml |