topdermal.com
robots.txt

Robots Exclusion Standard data for topdermal.com

Resource Scan

Scan Details

Site Domain topdermal.com
Base Domain topdermal.com
Scan Status Ok
Last Scan2025-12-03T11:48:30+00:00
Next Scan 2026-01-02T11:48:30+00:00

Last Scan

Scanned2025-12-03T11:48:30+00:00
URL https://topdermal.com/robots.txt
Domain IPs 178.211.133.105, 2a12:d280:100:96::1
Response IP 178.211.133.105
Found Yes
Hash 7e5b675950b7dd1d1945d62de4f7223146e00cce3a8f30181ad342d91bcb235c
SimHash 6338d85acc33

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp-includes/
Disallow /feed/
Disallow /trackback/
Disallow /attachment/
Disallow /author/
Disallow /xmlrpc.php
Disallow /?s=*
Disallow /?add-to-cart=*
Disallow /?attachment_id=*
Disallow /comments/feed/
Disallow /wp-content/uploads/
Disallow /readme.html
Disallow /checkout/
Disallow /my-account/
Disallow /cart/
Disallow /order-received/thank-you/

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot

Rule Path
Allow /

gptbot
ccbot
google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://topdermal.com/sitemap_index.xml

Comments

  • Allow AI search and agent use
  • Disallow AI training data collection