amp.elperiodico.cat
robots.txt

Robots Exclusion Standard data for amp.elperiodico.cat

Resource Scan

Scan Details

Site Domain amp.elperiodico.cat
Base Domain elperiodico.cat
Scan Status Ok
Last Scan2024-11-09T11:41:49+00:00
Next Scan 2024-11-16T11:41:49+00:00

Last Scan

Scanned2024-11-09T11:41:49+00:00
URL https://amp.elperiodico.cat/robots.txt
Redirect https://www.elperiodico.cat/robots.txt
Redirect Domain www.elperiodico.cat
Redirect Base elperiodico.cat
Domain IPs 199.232.194.133, 199.232.198.133
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 151.101.42.133
Found Yes
Hash d4f466fcceb839f575ab8fbc57849e24f14775244c2aed8b4c75761106694bd9
SimHash f5b4cc442d32

Groups

twitterbot

Rule Path
Allow /

google-extended

Rule Path
Allow /vida-i-estil/
Allow /societat/
Allow /economia/
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /*/ext_resources/ads/
Disallow /*/ext_resources/portadas/
Disallow /*/noticias/
Disallow /*/component/
Disallow /*/buscador*
Disallow /*/blogscat/
Disallow /swf/
Disallow /component/
Disallow /onbcn/
Disallow /stats/
Disallow /blogs/blogs/
Disallow /comentar.asp/
Disallow /comunes/
Disallow /info/
Disallow /buscador/ca/
Disallow /buscador/es/
Disallow /blogscat/
Disallow /suep/livefyre/
Disallow /valorar.asp
Disallow /comentar.asp
Disallow /r.asp
Disallow /print.asp
Disallow /archivo_titulares.asp
Disallow /envio.asp
Disallow /foros.asp
Disallow /valorada.asp
Disallow /galerias.asp
Disallow /alminuto.asp
Disallow /videos2.asp
Disallow /verpdf.asp
Disallow /alta.asp
Disallow /UpdatedNewsElPeriodico.xml
Disallow /verd-i-blau/
Disallow /buscant-respostes/
Disallow /*/newsletters/
Allow /*/noticias/*.xml