agricoleideal.com
robots.txt

Robots Exclusion Standard data for agricoleideal.com

Resource Scan

Scan Details

Site Domain agricoleideal.com
Base Domain agricoleideal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-05T15:10:15+00:00
Next Scan 2025-12-04T15:10:15+00:00

Last Successful Scan

Scanned2025-05-09T14:41:55+00:00
URL https://agricoleideal.com/robots.txt
Redirect https://www.agricoleideal.com/robots.txt
Redirect Domain www.agricoleideal.com
Redirect Base agricoleideal.com
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Redirect IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.64.1
Found Yes
Hash 7f9cb8124601cb8e9638afaaa8c60afbf8e2c83ad64f337589a6ea8de8bdd682
SimHash ef1c44655c91

Groups

petalbot
semrushbot

Rule Path
Disallow /

googlebot
adsbot-google
mediapartners-google
googlebot-image
*

Rule Path
Disallow /api/
Disallow /filter-count/
Disallow /admin/
Disallow /compare/
Disallow /favourites/
Disallow /listings/*/near/*
Disallow /includes/
Disallow /list/add-watch-list.cfm
Disallow /list/view_image.cfm
Disallow /list/view_image-print.cfm
Disallow /list/view_image-new.cfm
Disallow /list/emailpop-up.cfm
Disallow /pages/index.cfm
Disallow /pages/?PortalID=
Disallow /pages/?ClientID=
Disallow /pages/special.cfm
Disallow /pages/custom-content.cfm
Disallow /pages/view-listing.cfm
Disallow /display.cfm
Disallow /t/tracking.cfm

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.agdealer.com/storage/sitemaps/en/sitemap.xml
sitemap https://www.agdealer.com/storage/sitemaps/fr/sitemap.xml

Comments

  • completely block these user-agents
  • have to specifically block AdsBot-Google etc as they ignore the User-agent*