th4u.com
robots.txt

Robots Exclusion Standard data for th4u.com

Resource Scan

Scan Details

Site Domain th4u.com
Base Domain th4u.com
Scan Status Ok
Last Scan2025-09-30T04:05:24+00:00
Next Scan 2025-10-07T04:05:24+00:00

Last Scan

Scanned2025-09-30T04:05:24+00:00
URL https://th4u.com/robots.txt
Redirect https://www.th4u.com/robots.txt
Redirect Domain www.th4u.com
Redirect Base th4u.com
Domain IPs 104.21.15.106, 172.67.162.45, 2606:4700:3037::6815:f6a, 2606:4700:3037::ac43:a22d
Redirect IPs 104.21.15.106, 172.67.162.45, 2606:4700:3037::6815:f6a, 2606:4700:3037::ac43:a22d
Response IP 104.21.15.106
Found Yes
Hash 6e460bc2373df0c3959752156fa5f9decba39fd197bd36a66a966aead4fb2792
SimHash 4e00df156622

Groups

mediapartners-google

Rule Path
Allow /
Disallow /error_pages/
Disallow /guardian/
Disallow /soon.html

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Allow /$
Disallow /cgi-bin/axs/
Disallow /CGIProxy2.1b15/
Disallow /cgiproxy/
Disallow /.well-known/
Disallow /travel/thailand_vacation_packages.htm
Disallow /th4u.php
Disallow /test-*.html
Disallow /soon.html
Disallow /error_pages/
Disallow /*?sort=
Disallow /*?filter=

*

Rule Path
Allow /$
Disallow /ueditor/
Disallow /editor/
Disallow /Scripts/
Disallow /cms/
Disallow /admin/
Disallow /App_Data/
Disallow /backup/
Disallow /controller.ashx
Disallow /th4u.php
Disallow /*.zip$
Disallow /*.rar$
Disallow /*.gz$
Disallow /error_pages/
Disallow /guardian/

*

Rule Path
Disallow /401.html
Disallow /403.html
Disallow /404.html
Disallow /500.html

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 7

Other Records

Field Value
sitemap https://www.th4u.com/sitemap.xml

Comments

  • TH4U Thailand for You - Enhanced Robots.txt
  • ========================
  • ADSENSE CRAWLER SETTINGS
  • ========================
  • ========================
  • SEARCH ENGINE CRAWLERS
  • ========================
  • ========================
  • GLOBAL RULES
  • ========================
  • ========================
  • ERROR PAGE HANDLING
  • ========================
  • ========================
  • CRAWL DELAY SETTINGS
  • ========================