thinktq.com
robots.txt

Robots Exclusion Standard data for thinktq.com

Resource Scan

Scan Details

Site Domain thinktq.com
Base Domain thinktq.com
Scan Status Ok
Last Scan2025-10-22T21:12:32+00:00
Next Scan 2025-11-21T21:12:32+00:00

Last Scan

Scanned2025-10-22T21:12:32+00:00
URL https://thinktq.com/robots.txt
Domain IPs 104.26.12.160, 104.26.13.160, 172.67.73.92, 2606:4700:20::681a:ca0, 2606:4700:20::681a:da0, 2606:4700:20::ac43:495c
Response IP 172.67.73.92
Found Yes
Hash 8a5c02dccfaf6ec1063ed61bc6825f34b85c856f82e54fece988697a71a911be
SimHash 7b08f31aff43

Groups

twitterbot

Rule Path
Allow /training/

dittospyder
googlebot-image
vscooter

Rule Path
Disallow /

googlebot

Rule Path
Disallow /*.gif$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.png$

*

Rule Path
Disallow /ads/
Disallow /assets/
Disallow /common/
Disallow /documentation/
Disallow /downloads/
Disallow /gift/
Disallow /mytq/
Disallow /images/
Disallow /membership/
Disallow /order/
Disallow /private/
Disallow /redeem/
Disallow /scripts/
Disallow /template/
Disallow /testing/
Disallow /xml/
Allow /index.cfm
Allow /about/
Allow /blogs/
Allow /careerpower/
Allow /mytq/index.cfm
Allow /mytq/tqs_introduction.cfm
Allow /pda/
Allow /products/
Allow /training/
Allow /tq/
Allow /welcome/