earlylearninghq.org.uk
robots.txt

Robots Exclusion Standard data for earlylearninghq.org.uk

Resource Scan

Scan Details

Site Domain earlylearninghq.org.uk
Base Domain earlylearninghq.org.uk
Scan Status Ok
Last Scan2025-10-11T12:16:00+00:00
Next Scan 2025-11-10T12:16:00+00:00

Last Scan

Scanned2025-10-11T12:16:00+00:00
URL https://earlylearninghq.org.uk/robots.txt
Domain IPs 104.26.12.123, 104.26.13.123, 172.67.70.61, 2606:4700:20::681a:c7b, 2606:4700:20::681a:d7b, 2606:4700:20::ac43:463d
Response IP 104.26.12.123
Found Yes
Hash fe3ef60ecf63a0c890a4740a6dd10300ba3365f05638edf35cfd091162536825
SimHash 5114cb71f275

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /category/*/*
Disallow */mailchimp-subscription/*
Disallow */newsletter-dismiss.php*
Disallow /*?doing_wp_cron=*
Allow /wp-content/uploads
Allow /pagead/js/adsbygoogle.js

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Comments

  • Disallow: /trackback/
  • Disallow: /feed/
  • Disallow: /comments/
  • Disallow: */trackback/
  • Disallow: */feed/
  • Disallow: */comments/
  • Google Image
  • Google AdSense
  • Internet Archiver Wayback Machine
  • digg mirror