tlh.co.uk
robots.txt

Robots Exclusion Standard data for tlh.co.uk

Resource Scan

Scan Details

Site Domain tlh.co.uk
Base Domain tlh.co.uk
Scan Status Ok
Last Scan2025-05-07T19:37:55+00:00
Next Scan 2025-06-06T19:37:55+00:00

Last Scan

Scanned2025-05-07T19:37:55+00:00
URL https://tlh.co.uk/robots.txt
Redirect https://www.tlh.co.uk/robots.txt
Redirect Domain www.tlh.co.uk
Redirect Base tlh.co.uk
Domain IPs 104.26.8.153, 104.26.9.153, 172.67.69.158, 2606:4700:20::681a:899, 2606:4700:20::681a:999, 2606:4700:20::ac43:459e
Redirect IPs 104.26.8.153, 104.26.9.153, 172.67.69.158, 2606:4700:20::681a:899, 2606:4700:20::681a:999, 2606:4700:20::ac43:459e
Response IP 104.26.8.153
Found Yes
Hash cc1614ca346a55b493a846c0713fe703c033dde2692d49e2c85791dbcf479d2b
SimHash 60d05857edf1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/languages/
Disallow /wp-content/plugins/
Disallow /wp-content/upgrade/
Disallow /wp-includes/
Disallow /readme.html
Disallow /refer/
Allow /wp-admin/admin-ajax.php?action=frmpro_css
Allow /wp-content/plugins/abwp_formidableforms_pro/js/
Allow /wp-content/plugins/formidable/js/

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot

Rule Path
Allow /

gptbot
ccbot
google-extended

Rule Path
Disallow /

googlebot
bingbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.tlh.co.uk/sitemap_index.xml

Comments

  • Disallow access to admin areas for all bots
  • but make an exception for certain plugin files
  • Allow AI search and agent use
  • Disallow AI training data collection
  • Allow traditional search indexing