transconnect.com
robots.txt

Robots Exclusion Standard data for transconnect.com

Resource Scan

Scan Details

Site Domain transconnect.com
Base Domain transconnect.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-08-21T04:12:32+00:00
Next Scan 2025-09-04T04:12:32+00:00

Last Successful Scan

Scanned2025-07-14T02:22:17+00:00
URL https://transconnect.com/robots.txt
Domain IPs 46.19.218.211
Response IP 46.19.218.211
Found Yes
Hash 2243ef88c556bc4001450808b81afa8a2419fd25d845906dfa56355eb049de51
SimHash 79f25b6226d2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow */faq/*/
Disallow */ro/faq/*/
Disallow */nl/faq/*/
Disallow */fr/faq/*/
Disallow */de/faq/*/
Disallow */pl/faq/*/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /xmlrpc.php
Disallow /cgi-bin/
Disallow /trackback/
Disallow /readme.html
Disallow /license.txt
Disallow */wp-json/
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /wp-content/plugins/
Disallow /*?s=*
Disallow /search/*
Disallow /page/*/?s=*
Disallow /*?srsltid=*
Disallow /*?limit*
Disallow /*?sort*
Disallow /*?route*
Disallow /*?utm*
Disallow /*?jet-engine=*
Disallow /*?gclid=*
Disallow /*?cw_s=*
Disallow /*?vacancy=*
Disallow /*?gad_source=*

claude-user
claudebot
chatgpt-user
gptbot
perplexity-user
perplexitybot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.transconnect.com/sitemap_index.xml

Comments

  • Handling FAQ's
  • Additional rules
  • Toestaan van essentiĆ«le bestanden
  • Handling Search
  • Handling Parameters & Filters
  • Allow agentic-AI users
  • Sitemap vermelding