cuatro.com
robots.txt

Robots Exclusion Standard data for cuatro.com

Resource Scan

Scan Details

Site Domain cuatro.com
Base Domain cuatro.com
Scan Status Ok
Last Scan2024-05-24T19:34:25+00:00
Next Scan 2024-05-31T19:34:25+00:00

Last Scan

Scanned2024-05-24T19:34:25+00:00
URL https://cuatro.com/robots.txt
Redirect https://www.cuatro.com/robots.txt
Redirect Domain www.cuatro.com
Redirect Base cuatro.com
Domain IPs 34.243.193.197, 34.251.122.225, 52.209.112.80
Redirect IPs 173.222.147.215
Response IP 23.202.142.39
Found Yes
Hash 63155c46541e3bf9067571f5487ef1d91cec5ae9b82c9892e5a1fd7f171599a9
SimHash 8b2bdc808277

Groups

*

Rule Path
Disallow /buscador/*?text=*
Disallow /api/cms/
Disallow /mdswebservice/
Disallow /mdsvideo/
Disallow /mdsads/
Disallow /stats.html
Disallow /api/boards
Disallow /tags/*?text=*
Disallow /autores/*?text=*
Disallow /personajes/*?text=*
Disallow /20d/
Disallow /20p/

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

facebookexternalhit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot

Rule Path
Disallow *.shtml

yaanibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cuatro.com/sitemap_index.xml
sitemap https://www.cuatro.com/sitemap_ampstories.xml
sitemap https://www.cuatro.com/sitemap_google_news.xml