dailyexpress.co.uk
robots.txt

Robots Exclusion Standard data for dailyexpress.co.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	dailyexpress.co.uk
Base Domain	dailyexpress.co.uk
Scan Status	Ok
Last Scan	2024-05-12T19:42:36+00:00
Next Scan	2024-05-19T19:42:36+00:00

Last Scan

Scanned	2024-05-12T19:42:36+00:00
URL	https://dailyexpress.co.uk/robots.txt
Domain IPs	13.33.88.125, 13.33.88.2, 13.33.88.25, 13.33.88.51
Response IP	13.33.88.25
Found	Yes
Hash	aabd05da898889d59f1edbadbad14b2b2ff1b11bdcfdfe7b4083988698911082
SimHash	a8466a464721

Groups

*

Rule	Path	Comment
Disallow	/myexpress/	-
Disallow	/printer/	We'll keep the print version for our newspaper
Disallow	/users/	-
Disallow	/sponsored/	Advertorials
Disallow	/trackings/	Adserving
Disallow	/34722903/	Adserving
Disallow	/search?*	-
Disallow	/videos/get_video_by_uid/	-
Disallow	/videos/viewmeta/	-

Rule

Path

Comment

Disallow

/myexpress/

-

Disallow

/printer/

We'll keep the print version for our newspaper

Disallow

/users/

-

Disallow

/sponsored/

Advertorials

Disallow

/trackings/

Adserving

Disallow

/34722903/

Adserving

Disallow

/search?*

-

Disallow

/videos/get_video_by_uid/

-

Disallow

/videos/viewmeta/

-

grapeshot

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-news

Rule	Path	Comment
Disallow	/myexpress/	-
Disallow	/printer/	We'll keep the print version for our newspaper
Disallow	/users/	-
Disallow	/fun/	-
Disallow	/sponsored/	Advertorials
Disallow	/trackings/	Adserving
Disallow	/34722903/	Adserving
Disallow	/sponsoredfeatures	-
Disallow	/search?*	-
Disallow	/videos/get_video_by_uid/	-
Disallow	/videos/viewmeta/	-

Rule

Path

Comment

Disallow

/myexpress/

-

Disallow

/printer/

We'll keep the print version for our newspaper

Disallow

/users/

-

Disallow

/fun/

-

Disallow

/sponsored/

Advertorials

Disallow

/trackings/

Adserving

Disallow

/34722903/

Adserving

Disallow

/sponsoredfeatures

-

Disallow

/search?*

-

Disallow

/videos/get_video_by_uid/

-

Disallow

/videos/viewmeta/

-

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

/

nutch

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.express.co.uk/sitemap.xml
sitemap	https://www.express.co.uk/googlenews.xml

Field

Value

sitemap

https://www.express.co.uk/sitemap.xml

sitemap

https://www.express.co.uk/googlenews.xml

Back to top

Comments

170820-DXD-6728

Back to top

dailyexpress.co.ukrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

grapeshot

googlebot-news

ia_archiver

nutch

Other Records

Comments

dailyexpress.co.uk
robots.txt