trplus.com.tw
robots.txt

Robots Exclusion Standard data for trplus.com.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	trplus.com.tw
Base Domain	trplus.com.tw
Scan Status	Ok
Last Scan	4/10/2025, 6:40:56 AM
Next Scan	4/24/2025, 6:40:56 AM

Last Scan

Scanned	4/10/2025, 6:40:56 AM
URL	https://www.trplus.com.tw/robots.txt
Domain IPs	23.46.230.134, 23.46.230.158
Response IP	23.45.207.168
Found	Yes
Hash	a757e7b921070209280bb8a0142c82357d1989c10026bc89ddf2861f7ef7787a
SimHash	f844d61cedf0

Groups

googlebot-image

Rule	Path
Allow	/p/
Allow	/_ui/pages/sitemap/

Rule

Path

Allow

/p/

Allow

/_ui/pages/sitemap/

*

Product	Comment
*	For all robots

Rule	Path
Allow	/
Disallow	/_ui/edm/
Disallow	/_ui/event/
Disallow	/*?q=
Disallow	/QRCode/

Rule

Path

Allow

/

Disallow

/_ui/edm/

Disallow

/_ui/event/

Disallow

/*?q=

Disallow

/QRCode/

cazoodlebot

Product	Comment
cazoodlebot	Block CazoodleBot as it does not present correct accept content headers

Product

Comment

cazoodlebot

Block CazoodleBot as it does not present correct accept content headers

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Product	Comment
mj12bot	Block MJ12bot as it is just noise

Product

Comment

mj12bot

Block MJ12bot as it is just noise

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot/1.0

Product	Comment
dotbot/1.0	Block dotbot as it cannot parse base urls properly

Product

Comment

dotbot/1.0

Block dotbot as it cannot parse base urls properly

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gigabot

Product	Comment
gigabot	Block Gigabot

Product

Comment

gigabot

Block Gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.trplus.com.tw/_ui/marketing/sitemap/sitemap.xml

Field

Value

sitemap

https://www.trplus.com.tw/_ui/marketing/sitemap/sitemap.xml

Back to top

Comments

Request-rate: 1/10 # maximum rate is one page every 10 seconds
Crawl-delay: 10 # 10 seconds between page requests
Visit-time: 0200-0845 # only visit between 04:00 and 08:45 UTC

Back to top

trplus.com.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot-image

*

cazoodlebot

mj12bot

dotbot/1.0

gigabot

Other Records

Comments

trplus.com.tw
robots.txt