caterpillaruniversity.com
robots.txt

Robots Exclusion Standard data for caterpillaruniversity.com

Resource Scan

Scan Details

Site Domain caterpillaruniversity.com
Base Domain caterpillaruniversity.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2026-01-21T01:18:31+00:00
Next Scan 2026-04-21T01:18:31+00:00

Last Successful Scan

Scanned2023-12-09T22:49:48+00:00
URL https://caterpillaruniversity.com/robots.txt
Domain IPs 23.236.62.147
Response IP 23.236.62.147
Found Yes
Hash aab05acb680e852b996f0989cd53aed4c299d0c2aeffc49d3c9c2a234b33cb2d
SimHash 48d28a42d656

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow *?lightbox=

adsbot-google-mobile
adsbot-google

Rule Path
Disallow /_api/*
Disallow /_partials*
Disallow /pro-gallery-webapp/v1/galleries/*

petalbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.caterpillaruniversity.com/sitemap.xml

Comments

  • Optimization for Google Ads Bot
  • Block PetalBot
  • Crawl delay for overly enthusiastic bots
  • Auto generated, go to SEO Tools > Robots.txt Editor to change this