deltasimons.com
robots.txt

Robots Exclusion Standard data for deltasimons.com

Resource Scan

Scan Details

Site Domain deltasimons.com
Base Domain deltasimons.com
Scan Status Ok
Last Scan2024-09-17T05:26:28+00:00
Next Scan 2024-10-17T05:26:28+00:00

Last Scan

Scanned2024-09-17T05:26:28+00:00
URL https://deltasimons.com/robots.txt
Domain IPs 165.22.117.132
Response IP 165.22.117.132
Found Yes
Hash 889125289369d37be3501799f88402601c656736af05ef8f5172e10ed7716598
SimHash afef4c50f4a2

Groups

*
adsbot-google

Rule Path
Disallow /wp-json/
Disallow /?rest_route=
Disallow /wp-admin/
Disallow /wp-content/cache/
Disallow /wp-content/plugins/
Disallow /wp-login.php
Disallow /xmlrpc.php

*
adsbot-google

Rule Path
Disallow /wp-includes/
Allow /wp-includes/css/
Allow /wp-includes/js/

*
adsbot-google

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

adsbot-google

Rule Path
Disallow /*.woff2

*
adsbot-google

Rule Path
Disallow /wp-content/uploads/complianz/
Disallow /?wp-ajax=

*
adsbot-google

Rule Path
Disallow /cdn-cgi/bm/cv/
Disallow /cdn-cgi/challenge-platform/
Disallow /cdn-cgi/images/trace/
Disallow /cdn-cgi/rum
Disallow /cdn-cgi/scripts/
Disallow /cdn-cgi/styles/
Disallow /cdn-cgi/zaraz/

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
mj12bot
ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.luciongroup.com/sitemap_index.xml

Comments

  • Block some general WP endpoints
  • -------------------------------
  • Special handling for /wp-includes/
  • ----------------------------------
  • Block internal search
  • ---------------------
  • Adsbot doesn't ever need to crawl fonts
  • ---------------------------------------
  • Block leaky plugins
  • -------------------
  • Block leaky Cloudflare endpoints
  • --------------------------------
  • Ban noisy bots
  • --------------
  • Sitemap
  • -------