latintimes.com
robots.txt

Robots Exclusion Standard data for latintimes.com

Resource Scan

Scan Details

Site Domain latintimes.com
Base Domain latintimes.com
Scan Status Ok
Last Scan2024-04-23T11:59:54+00:00
Next Scan 2024-04-30T11:59:54+00:00

Last Scan

Scanned2024-04-23T11:59:54+00:00
URL https://latintimes.com/robots.txt
Redirect https://www.latintimes.com/robots.txt
Redirect Domain www.latintimes.com
Redirect Base latintimes.com
Domain IPs 3.214.206.47, 3.229.37.183
Redirect IPs 3.214.206.47, 3.229.37.183
Response IP 3.214.206.47
Found Yes
Hash dd6c046f14f317413c911ccb6fb2030daface18a352046e1f8421c79a879e167
SimHash b30ba040e913

Groups

*

Rule Path
Disallow /feeds/*
Disallow /system/
Disallow /rss/articles/specialcat/
Disallow /addineyeV2l
Disallow /eyeblaster
Disallow /doubleclick
Disallow /ads/sponsored
Disallow /comment/*
Disallow /search?q=*
Disallow /corporate/newsletter/signup?*
Disallow /*?utm_source=*

Other Records

Field Value
sitemap https://www.latintimes.com/sitemap.xml
sitemap https://www.latintimes.com/googlenews.xml