sirtaptap.com
robots.txt

Robots Exclusion Standard data for sirtaptap.com

Resource Scan

Scan Details

Site Domain sirtaptap.com
Base Domain sirtaptap.com
Scan Status Ok
Last Scan2024-05-30T00:04:24+00:00
Next Scan 2024-06-06T00:04:24+00:00

Last Scan

Scanned2024-05-30T00:04:24+00:00
URL https://sirtaptap.com/robots.txt
Domain IPs 104.26.14.178, 104.26.15.178, 172.67.74.183, 2606:4700:20::681a:eb2, 2606:4700:20::681a:fb2, 2606:4700:20::ac43:4ab7
Response IP 172.67.74.183
Found Yes
Hash a706c5709394cd8e1924f654e79765ae5681b37d69c101e527ca5c35a9bcd60d
SimHash d0389b1a5b37

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /humix/
Allow /wp-admin/admin-ajax.php

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sertaptap.com/sitemap_index.xml

Comments

  • AI bot blocking section
  • Used for many other (non-commercial) purposes as well
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis only?
  • Multi-purpose, commercial uses; including LLMs