tgptruth.com
robots.txt

Robots Exclusion Standard data for tgptruth.com

Resource Scan

Scan Details

Site Domain tgptruth.com
Base Domain tgptruth.com
Scan Status Ok
Last Scan2025-11-04T00:58:41+00:00
Next Scan 2025-12-04T00:58:41+00:00

Last Scan

Scanned2025-11-04T00:58:41+00:00
URL https://tgptruth.com/robots.txt
Domain IPs 104.21.51.72, 172.67.176.243, 2606:4700:3035::6815:3348, 2606:4700:3036::ac43:b0f3
Response IP 104.21.51.72
Found Yes
Hash 7cca2d18a3b09b48fca2e975c4a458263e5c9cadaa8fe361b714701d8ae07086
SimHash 6c0a4a40a5f5

Groups

*

Rule Path
Disallow /*?p=*
Disallow /*%26p%3D*
Disallow /*?s=*
Disallow /*%26s%3D*
Disallow /*?ical=1
Disallow /*%26ical%3D1
Disallow /?author=*
Disallow /*wp-comments*
Disallow /*wp-trackback*
Disallow /*wp-feed*
Disallow /*replytocom%3D*
Disallow /*?preview=*
Disallow /*%26preview%3D*
Disallow /*add-to-cart%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*cart/*
Disallow /*checkout/*
Disallow /*my-account/*
Disallow /*myaccount/*
Disallow /*?ajaxCalendar=1*
Allow /*/plugins/*

Other Records

Field Value
crawl-delay 1

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://tgptruth.com/sitemap_index.xml

Comments

  • Stop bots from crawling junk URLs
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK