cl.trud.com
robots.txt

Robots Exclusion Standard data for cl.trud.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cl.trud.com
Base Domain	trud.com
Scan Status	Ok
Last Scan	2024-09-16T05:16:17+00:00
Next Scan	2024-10-16T05:16:17+00:00

Last Scan

Scanned	2024-09-16T05:16:17+00:00
URL	https://cl.trud.com/robots.txt
Domain IPs	104.21.53.21, 172.67.207.200, 2606:4700:3036::ac43:cfc8, 2606:4700:3037::6815:3515
Response IP	172.67.207.200
Found	Yes
Hash	0df23c001a4bfa5efaac68cd633ea009253d800cec15f2b72dd0e75866ad5e67
SimHash	42062e424b33

Groups

*

Rule	Path
Allow	/company.html?page=
Allow	/css/min/main.min.css?
Disallow	*?
Disallow	*/search/
Disallow	/crm/
Disallow	/crm2/
Disallow	/office/
Disallow	/site/redirect/url

Rule

Path

Allow

/company.html?page=

Allow

/css/min/main.min.css?

Disallow

*/search/

Disallow

/crm/

Disallow

/crm2/

Disallow

/office/

Disallow

/site/redirect/url

moget
ichiro

Rule	Path
Disallow	/

Rule

Path

Disallow

naverbot
yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider
baiduspider-video
baiduspider-image

Rule	Path
Disallow	/

Rule

Path

Disallow

youdaobot

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

solomonobot

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blekkobot

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

psbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

irlbot

Rule	Path
Disallow	/

Rule

Path

Disallow

twiceler

Rule	Path
Disallow	/

Rule

Path

Disallow

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

webinject

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

shopwiki

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://cl.trud.com/sitemap/cl.trud.com-sitemap.xml.gz

Field

Value

sitemap

https://cl.trud.com/sitemap/cl.trud.com-sitemap.xml.gz

Warnings

`host` is not a known field.

cl.trud.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mogetichiro

naverbotyeti

baiduspiderbaiduspider-videobaiduspider-image

youdaobot

seokicks-robot

mj12bot

solomonobot

rogerbot

semrushbot

blekkobot

sistrix

proximic

turnitinbot

psbot

gigabot

irlbot

twiceler

cazoodlebot

webinject

spbot

grapeshot

shopwiki

Other Records

Warnings

cl.trud.com
robots.txt

moget
ichiro

naverbot
yeti

baiduspider
baiduspider-video
baiduspider-image