apc1040.com
robots.txt

Robots Exclusion Standard data for apc1040.com

Resource Scan

Scan Details

Site Domain apc1040.com
Base Domain apc1040.com
Scan Status Ok
Last Scan2025-11-22T22:42:54+00:00
Next Scan 2025-12-22T22:42:54+00:00

Last Scan

Scanned2025-11-22T22:42:54+00:00
URL https://apc1040.com/robots.txt
Domain IPs 160.153.0.94
Response IP 160.153.0.94
Found Yes
Hash aaf2926dc578f01cff622b3550bb5bd7694da2404786b965344f77916abd667f
SimHash 4f414850c1a3

Groups

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

yandex

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /cgi-bin/
Disallow /feed/
Disallow /*/feed/
Disallow /comments/feed/
Disallow /category/*/feed/
Disallow /tag/*/feed/
Disallow /author/*/feed/
Disallow /?feed=
Allow /wp-admin/admin-ajax.php
Allow /*.css$
Allow /*.js$
Allow /*.png$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.webp$
Allow /*.svg$
Allow /cart
Allow /checkout
Allow /order
Allow /order-received
Allow /order-confirmation
Allow /thank-you
Allow /payment
Allow /payment-confirmation
Allow /*/checkout?*
Allow /*/order?*

Other Records

Field Value
sitemap https://apc1040.com/sitemap_index.xml

Comments

  • robots.txt for apc1040.com
  • Purpose: Allow good bots, block abusive crawlers, disallow feeds and admin areas
  • Last updated: 12 Oct 2025
  • 1) Block known abusive / spammy bots
  • 2) Allow major search engines
  • 3) Global rules for all bots
  • Block admin, login, feeds, and internal scripts
  • Allow AJAX and all essential assets (CSS/JS/images)
  • 4) Explicitly allow transaction & conversion pages
  • 5) Sitemap location