infoagro.com
robots.txt

Robots Exclusion Standard data for infoagro.com

Resource Scan

Scan Details

Site Domain infoagro.com
Base Domain infoagro.com
Scan Status Ok
Last Scan2024-10-06T16:25:26+00:00
Next Scan 2024-10-13T16:25:26+00:00

Last Scan

Scanned2024-10-06T16:25:26+00:00
URL https://infoagro.com/robots.txt
Domain IPs 217.76.143.127
Response IP 217.76.143.127
Found Yes
Hash 550c5cb317088950e6b8f0697a028223cfd3d41a59a58a6742e025370edd5533
SimHash 541dda658280

Groups

*

Rule Path
Disallow /compraventa/poner_anuncio.asp
Disallow /compraventa/poner_anuncio2.asp
Disallow /admin/
Disallow /empresas/incluir_empresa.asp
Disallow /empresas/mejorar_posicion.asp
Disallow /empresas/login.asp
Disallow /empresas/pwd_recovery.asp
Disallow /formacion/login.asp
Disallow /formacion/olvido.asp
Disallow /formacion/paypal_payment.asp
Disallow /formacion/paypal_confirmation.asp
Disallow /instrumentos_medida/makeorder.asp
Disallow /includes/js
Disallow /tools
Disallow /advertisment
Disallow /precios_origen/alhondigas/recordar_pwd.htm
Disallow /foro/register.asp
Disallow /foro/search.asp
Disallow /foro/profile.asp
Disallow /foro/sendtofriend_form.asp
Disallow /foro/sendcomment_form.asp
Disallow /galeria/enviar_foto.asp

blexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.infoagro.com/sitemap.xml
sitemap https://www.infoagro.com/sitemap_noticias.xml