deagostini.com
robots.txt

Robots Exclusion Standard data for deagostini.com

Resource Scan

Scan Details

Site Domain deagostini.com
Base Domain deagostini.com
Scan Status Ok
Last Scan2025-03-05T05:42:13+00:00
Next Scan 2025-04-04T05:42:13+00:00

Last Scan

Scanned2025-03-05T05:42:13+00:00
URL https://deagostini.com/robots.txt
Redirect https://www.deagostini.com/robots.txt
Redirect Domain www.deagostini.com
Redirect Base deagostini.com
Domain IPs 52.16.150.218
Redirect IPs 2600:9000:2894:3400:1e:ea63:adc0:93a1, 2600:9000:2894:9600:1e:ea63:adc0:93a1, 2600:9000:2894:9e00:1e:ea63:adc0:93a1, 2600:9000:2894:ba00:1e:ea63:adc0:93a1, 2600:9000:2894:be00:1e:ea63:adc0:93a1, 2600:9000:2894:c400:1e:ea63:adc0:93a1, 2600:9000:2894:ce00:1e:ea63:adc0:93a1, 2600:9000:2894:f000:1e:ea63:adc0:93a1, 3.170.229.109, 3.170.229.23, 3.170.229.62, 3.170.229.64
Response IP 3.170.229.62
Found Yes
Hash bbaf904763b587bd523c1574b1cf162ff607fdf6c4368363f4b7036ed79fb39f
SimHash 41503f454472

Groups

*

Rule Path
Allow /
Disallow /first-issue/*
Disallow /blog/*

adsbot-google

Rule Path
Disallow

googlebot

Rule Path
Allow */myarea/*.js
Allow */myarea/*.css
Disallow */myarea/*
Disallow /*?

googlebot-image

Rule Path
Disallow

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.deagostini.com/sitemap-index.xml

Comments

  • Prevents locked resources for Adsbot-Google
  • Prevents locked resources for Chat GPT
  • Indice de sitemaps