infoworld.com
robots.txt

Robots Exclusion Standard data for infoworld.com

Resource Scan

Scan Details

Site Domain infoworld.com
Base Domain infoworld.com
Scan Status Ok
Last Scan2025-08-04T05:50:21+00:00
Next Scan 2025-08-11T05:50:21+00:00

Last Scan

Scanned2025-08-04T05:50:21+00:00
URL https://infoworld.com/robots.txt
Redirect https://www.infoworld.com/robots.txt
Redirect Domain www.infoworld.com
Redirect Base infoworld.com
Domain IPs 192.0.66.100
Redirect IPs 192.0.66.100
Response IP 192.0.66.100
Found Yes
Hash 0dc37932e9d549cd9a8559e32c93069b343959169904ca342ad14ddf30f91fa1
SimHash bb4e0810c1b7

Groups

*

Rule Path
Disallow
Disallow /search*
Disallow */filter/*
Disallow /*?utm_*
Disallow /*//*

ai2bot-dolma

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

https://hada.news

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekr

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

linkedinbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.infoworld.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK