peachpitting.com
robots.txt

Robots Exclusion Standard data for peachpitting.com

Resource Scan

Scan Details

Site Domain peachpitting.com
Base Domain peachpitting.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-13T08:56:56+00:00
Next Scan 2025-10-27T08:56:56+00:00

Last Successful Scan

Scanned2025-09-28T00:34:45+00:00
URL https://peachpitting.com/robots.txt
Domain IPs 104.21.12.244, 172.67.153.243, 2606:4700:3030::ac43:99f3, 2606:4700:3033::6815:cf4
Response IP 104.21.12.244
Found Yes
Hash d7631f757b6544787335a01529f86f256b9515376c925d216b921412e2c52b3b
SimHash f072db1a5737

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /archives/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow *?wptheme
Disallow /search?
Disallow /feeds
Disallow /wp-login.php
Disallow /novel/*

mediapartners-google

Rule Path
Allow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Comments

  • Used for many other (non-commercial) purposes as well
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis only?
  • Multi-purpose, commercial uses; including LLMs

Warnings

  • 1 invalid line.