agtechnavigator.com
robots.txt

Robots Exclusion Standard data for agtechnavigator.com

Resource Scan

Scan Details

Site Domain agtechnavigator.com
Base Domain agtechnavigator.com
Scan Status Ok
Last Scan2025-05-18T23:14:35+00:00
Next Scan 2025-05-19T23:14:35+00:00

Last Scan

Scanned2025-05-18T23:14:35+00:00
URL https://agtechnavigator.com/robots.txt
Redirect https://www.agtechnavigator.com:443/robots.txt
Redirect Domain www.agtechnavigator.com
Redirect Base agtechnavigator.com
Domain IPs 15.197.68.198, 15.197.82.42
Redirect IPs 23.46.230.137, 23.46.230.157, 2600:1413:5000:3::1736:7692, 2600:1413:5000:3::1736:76a3
Response IP 125.56.219.33
Found Yes
Hash 45672f0aee05526e707ad1e4daf393ffd9c4c00215fc2310e8bd0cbc6c574b4a
SimHash f41ef110c49d

Groups

*

Rule Path
Disallow /search/
Disallow /newsletter/
Disallow */pf/api/

mozilla/5.0 (x11; linux x86_64; rv:87.0) gecko/20100101 firefox/87.0;onetrustbot;

Rule Path
Disallow

anthropic-ai
bytespider
chatgpt
claudebot
gptbot
oai-searchbot

Rule Path
Disallow /search/
Disallow /newsletter/
Disallow */pf/api/

Other Records

Field Value
crawl-delay 1

ahrefsbot
ahrefssiteaudit
amazonbot
archive-it
audacy-podcast-scraper
discordbot
facebookexternalhit
feedburner
feedly
feedparser
feedvalidator
flipboard
gofeed
hubspot crawler
netnewswire
pinterestbot
twitterbot
whatsapp

Rule Path
Disallow /search/
Disallow /newsletter/
Disallow */pf/api/

Other Records

Field Value
crawl-delay 1

adsbot
aliyunsecbot
archivebot
awariobot
backlinksextendedbot
baiduspider
barkrowler
birdcrawlerbot
bitsightbot
blackboardally
blexbot
blogtrottr
bluechipbacklinks
bpimagewalker
bravebot
ccbot
cincraw
cis5550-crawler
cloudservermarketspider
coccocbot
coibotparser
crsspxlbot
cyberfindcrawler
cyotekwebcopy
darwin
dataforseobot
deepcrawl
diffbot
domainsbot
domainstatsbot
domcopbot
dotbot
dragonbot
dubbotbot
emailwolf
equellaurlbot
ev-crawler
exabot
facebot
faraday
feedbot
filemaker
glmslinkanalysis
grapeshotcrawler
harsilbot
iaskspider
imagesiftbot
inetdex-bot
informa java api
intently
io_bot
isscyberriskcrawler
istellabot
kazbtbot
linguee bot
linkanalyser
linkdexbot
linkfluence
livelapbot
magus bot
maxthon
mixrankbot
mj12bot
mj12bot
mojeekbot
monsidobot
moodlebot
netcraftsurveyagent
netestate ne crawler
neticle crawler
netpeak
newsify
nicecrawler
nodeping
online-webceo-bot
openindexspider
orbbot
paqlebot
petalbot
pocketparser
pubmatic crawler bot
rankurbot
scrapy
searchbot
sebot
seekport crawler
seekportbot
seekrbot
semanticscholarbot
sentibot
seo-audit-check-bot
seobilitybot
serpstatbot
seznambot
seznambot
simplepie
siteauditbot
sitebulb
siteone-crawler
smtbot
sprinklr
srcedamp
staffbase
startmebot
storebot
superbot
superpagesbot2
surdotlybot
tentacles
timpibot
trendictionbot0
turnitin
uk_lddc_bot
ultimate_sitemap_parser
vebidoobot
velenpublicwebcrawler
wallabyupbot
webwikibot
wellknownbot
wepchsearchengine
xbot
yeti
yisouspider
zoominfobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/news-sitemap/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-section/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/videos/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/promotional-features/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/product-innovations/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/suppliers/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/events/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Funding-mergers-acquisitions/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Start-ups/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Policy-regulation/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Innovation-trends/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Sustainability/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Business/Accreditations/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Precision-agriculture/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Regenerative-agriculture/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Indoor-vertical-farming/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Animal-health/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Agri-food/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Sectors/Blue-food/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Robotics-automation-equipment/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Crop-inputs/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Genomics/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Digitisation/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/AI/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Supply-chain/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Tech/Biologicals/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Soil-health/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Biodiversity/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Water/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Methane-reduction/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Carbon/
sitemap https://www.agtechnavigator.com/arc/outboundfeeds/sitemap-index/category/Environment/Waste-reduction-valorisation/

Comments

  • Block all robots going to the following directories
  • OneTrustBot can go anywhere
  • AI bots specific crawl-delay
  • Other bots with specific crawl-delay
  • Unwanted bots
  • Sitemaps

Warnings

  • 3 invalid lines.