trplus.com.tw
robots.txt

Robots Exclusion Standard data for trplus.com.tw

Resource Scan

Scan Details

Site Domain trplus.com.tw
Base Domain trplus.com.tw
Scan Status Ok
Last Scan4/10/2025, 6:40:56 AM
Next Scan 4/24/2025, 6:40:56 AM

Last Scan

Scanned4/10/2025, 6:40:56 AM
URL https://www.trplus.com.tw/robots.txt
Domain IPs 23.46.230.134, 23.46.230.158
Response IP 23.45.207.168
Found Yes
Hash a757e7b921070209280bb8a0142c82357d1989c10026bc89ddf2861f7ef7787a
SimHash f844d61cedf0

Groups

googlebot-image

Rule Path
Allow /p/
Allow /_ui/pages/sitemap/

*

Product Comment
* For all robots
Rule Path
Allow /
Disallow /_ui/edm/
Disallow /_ui/event/
Disallow /*?q=
Disallow /QRCode/

cazoodlebot

Product Comment
cazoodlebot Block CazoodleBot as it does not present correct accept content headers
Rule Path
Disallow /

mj12bot

Product Comment
mj12bot Block MJ12bot as it is just noise
Rule Path
Disallow /

dotbot/1.0

Product Comment
dotbot/1.0 Block dotbot as it cannot parse base urls properly
Rule Path
Disallow /

gigabot

Product Comment
gigabot Block Gigabot
Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.trplus.com.tw/_ui/marketing/sitemap/sitemap.xml

Comments

  • Request-rate: 1/10 # maximum rate is one page every 10 seconds
  • Crawl-delay: 10 # 10 seconds between page requests
  • Visit-time: 0200-0845 # only visit between 04:00 and 08:45 UTC