tortoisemedia.com
robots.txt

Robots Exclusion Standard data for tortoisemedia.com

Resource Scan

Scan Details

Site Domain tortoisemedia.com
Base Domain tortoisemedia.com
Scan Status Ok
Last Scan2024-06-04T16:00:27+00:00
Next Scan 2024-06-18T16:00:27+00:00

Last Scan

Scanned2024-06-04T16:00:27+00:00
URL https://tortoisemedia.com/robots.txt
Redirect https://www.tortoisemedia.com/robots.txt
Redirect Domain www.tortoisemedia.com
Redirect Base tortoisemedia.com
Domain IPs 13.33.30.106, 13.33.30.121, 13.33.30.126, 13.33.30.36
Redirect IPs 151.101.131.42, 151.101.195.42, 151.101.3.42, 151.101.67.42
Response IP 199.232.47.42
Found Yes
Hash ce0e23aa687552ba14bca9af4c82a6665bc6fb01ba470c35eebdac46f72815b4
SimHash f0789b1a5737

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path Comment
Disallow / Used for many other (non-commercial) purposes as well

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Comments

  • Used for many other (non-commercial) purposes as well
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis only?
  • Multi-purpose, commercial uses; including LLMs
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis only?
  • Multi-purpose, commercial uses; including LLMs