plough.com
robots.txt

Robots Exclusion Standard data for plough.com

Resource Scan

Scan Details

Site Domain plough.com
Base Domain plough.com
Scan Status Ok
Last Scan2024-09-18T00:43:46+00:00
Next Scan 2024-10-18T00:43:46+00:00

Last Scan

Scanned2024-09-18T00:43:46+00:00
URL https://plough.com/robots.txt
Redirect https://www.plough.com/robots.txt
Redirect Domain www.plough.com
Redirect Base plough.com
Domain IPs 52.176.6.0
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash 64f57dd30b14a6039e3a85ae166be726efe628989a7c887fb7fbf00e91c38f9e
SimHash 9d483d2d6812

Groups

*

Rule Path
Disallow /_Classes/
Disallow /_Controls/
Disallow /_data/
Disallow /_EnvironmentSettings/
Disallow /_Flash/
Disallow /_Utilities/
Disallow /IndexSearhHelpers/
Disallow /sitecore/
Disallow /temp/
Disallow /upload/
Disallow /xsl/
Disallow /test/
Disallow /-/media/Files/Plough/ebooks/
Disallow /-/media/files/plough/ebooks/
Disallow /-/media/Files/Plough/audiobooks/
Disallow /-/media/files/plough/audiobooks/
Disallow /-/media/Files/Plough/ebooks/
Disallow /-/media/files/plough/ebooks/
Disallow /-/media/Files/Plough/audiobooks/
Disallow /-/media/files/plough/audiobooks/
Disallow /-/media/files/plough/magazines/Quarterly
Disallow /-/media/files/plough/magazines/quarterly

Other Records

Field Value
sitemap https://www.plough.com/sitemapPlough.xml

Warnings

  • 1 invalid line.