earlyschool.net
robots.txt

Robots Exclusion Standard data for earlyschool.net

Resource Scan

Scan Details

Site Domain earlyschool.net
Base Domain earlyschool.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-05-06T14:57:20+00:00
Next Scan 2024-07-05T14:57:20+00:00

Last Successful Scan

Scanned2024-02-14T10:36:40+00:00
URL https://earlyschool.net/robots.txt
Domain IPs 185.230.63.107, 185.230.63.171, 185.230.63.186
Response IP 185.230.63.107
Found Yes
Hash 736e7c20924e3db11ed5d8e110b0c09ca3d356e419f51de2d861e5bc730fbdf5
SimHash 68d68a62d616

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow *?lightbox=

adsbot-google-mobile
adsbot-google

Rule Path
Disallow /_api/*
Disallow /_partials*
Disallow /pro-gallery-webapp/v1/galleries/*

petalbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.earlyschool.net/sitemap.xml

Comments

  • Optimization for Google Ads Bot
  • Block PetalBot
  • Crawl delay for overly enthusiastic bots
  • Auto generated, go to SEO Tools > Robots.txt Editor to change this