cioreviewapac.com
robots.txt

Robots Exclusion Standard data for cioreviewapac.com

Resource Scan

Scan Details

Site Domain cioreviewapac.com
Base Domain cioreviewapac.com
Scan Status Ok
Last Scan2024-11-02T10:09:09+00:00
Next Scan 2024-11-09T10:09:09+00:00

Last Scan

Scanned2024-11-02T10:09:09+00:00
URL https://cioreviewapac.com/robots.txt
Redirect https://www.cioreviewapac.com/robots.txt
Redirect Domain www.cioreviewapac.com
Redirect Base cioreviewapac.com
Domain IPs 18.136.111.115
Redirect IPs 18.136.111.115
Response IP 18.136.111.115
Found Yes
Hash c2967758c092a025c576f287a3765b83728b2b6cd717e1830f172aabe34d4ec8
SimHash 3921de90c793

Groups

*

Rule Path Comment
Disallow / Disallow all bots by default
Disallow /scripts/ -
Disallow /*.php$ -
Disallow /zlib/ -
Disallow /lib/ -
Disallow /utils/ -
Disallow /magazine/ -
Disallow /admin/ -

googlebot
googlebot-image
mediapartners-google
adsbot-google
googlebot-mobile
msnbot
slurp
msicrawler
baiduspider
bingbot
seositecheckupbot

Rule Path
Disallow /magazine/
Disallow /admin/

screaming frog seo spider
linkedinbot
twitterbot
ahrefsbot

Rule Path
Allow /

ahrefssiteaudit

Rule Path
Allow /

yandexbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.cioreviewapac.com/cioreviewapac_news_sitemap.xml
sitemap https://www.cioreviewapac.com/all_in_one_sitemap_main.xml
sitemap https://www.cioreviewapac.com/cioreviewapac_newnews_sitemap.xml
sitemap https://www.cioreviewapac.com/contributors_sitemap.xml

Comments

  • Allow major search engines
  • Allow specific tools
  • Allow AhrefsBot
  • Allow AhrefsSiteAudit