manutd.com
robots.txt

Robots Exclusion Standard data for manutd.com

Resource Scan

Scan Details

Site Domain manutd.com
Base Domain manutd.com
Scan Status Ok
Last Scan2024-11-16T09:21:55+00:00
Next Scan 2024-11-30T09:21:55+00:00

Last Scan

Scanned2024-11-16T09:21:55+00:00
URL https://www.manutd.com/robots.txt
Domain IPs 104.83.197.27
Response IP 184.25.221.26
Found Yes
Hash b275c79f7e90abfb8df4236c8574696eacc342b845f0b6a1e73331fde3aff7ea
SimHash e91f0e415ab3

Groups

*

Rule Path
Disallow /sitecore/
Disallow /mufc/
Allow /AssetPicker/Images/
Allow /AssetPicker/images/
Disallow /AssetPicker/
Disallow /assets/
Disallow /ruxitagentjs_2SVfqr_10139180201161915.js
Disallow /build/
Disallow /en/myunited/
Disallow /fr/myunited/
Disallow /es/myunited/
Disallow /en/myunited/
Disallow /ar/myunited/
Disallow /zh/myunited/
Disallow /ja/myunited/
Disallow /ko/myunited/
Disallow */social*
Disallow */collection*
Disallow /*search?q=*

twitterbot

Rule Path
Allow /AssetPicker/Images
Disallow /*search?q=*

facebookexternalhit

Rule Path
Allow /AssetPicker/Images
Disallow /*search?q=*

googlebot

Rule Path
Allow /build/
Disallow /*search?q=*

googlebot-news

Rule Path
Allow /build/
Disallow /*search?q=*

googlebot-video

Rule Path
Allow /build/
Disallow /*search?q=*

googlebot-mobile

Rule Path
Allow /build/
Disallow /*search?q=*

googlebot-image

Rule Path
Allow /build/
Disallow /*search?q=*

Other Records

Field Value
sitemap https://www.manutd.com/sitemap.xml