amp.diariodemallorca.es
robots.txt

Robots Exclusion Standard data for amp.diariodemallorca.es

Resource Scan

Scan Details

Site Domain amp.diariodemallorca.es
Base Domain diariodemallorca.es
Scan Status Ok
Last Scan2024-06-26T01:07:28+00:00
Next Scan 2024-07-03T01:07:28+00:00

Last Scan

Scanned2024-06-26T01:07:28+00:00
URL https://amp.diariodemallorca.es/robots.txt
Redirect https://www.diariodemallorca.es/robots.txt
Redirect Domain www.diariodemallorca.es
Redirect Base diariodemallorca.es
Domain IPs 199.232.194.133, 199.232.198.133
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 146.75.94.133
Found Yes
Hash 9930dde4a0dae0f4bdf297d2f6a81d4234981e1aa50df169ac038dc66d667f97
SimHash e8045190c331

Groups

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

google-extended

Rule Path
Allow /vida-y-estilo/
Allow /ocio/
Allow /sociedad/
Allow /economia/
Allow /viajes/
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /
Allow /ocio/cine/cartelera/$
Disallow /ocio/cine/cartelera/
Disallow /motogp/
Disallow /hemeroteca/buscador/
Allow /ocio/hosteleria/*/mallorca_m/*_s/
Allow /ocio/hosteleria/*/mallorca_m/*_p/
Disallow /ocio/hosteleria/*/*_s/
Disallow /ocio/hosteleria/*/*_p/
Disallow /ocio/hosteleria/*/*_m/
Disallow /tags/letra/
Disallow /tour-francia/
Disallow /vuelta-espana/
Disallow /medio-ambiente/
Disallow /nacional/
Disallow /internacional/
Disallow /tiempo/

*

Rule Path
Disallow /*?p=

Other Records

Field Value
sitemap https://www.diariodemallorca.es/sitemap_index_f3750.xml
sitemap https://www.diariodemallorca.es/sitemap_google_news_f3750.xml