mic.com
robots.txt

Robots Exclusion Standard data for mic.com

Resource Scan

Scan Details

Site Domain mic.com
Base Domain mic.com
Scan Status Ok
Last Scan2024-11-02T06:03:39+00:00
Next Scan 2024-11-09T06:03:39+00:00

Last Scan

Scanned2024-11-02T06:03:39+00:00
URL https://mic.com/robots.txt
Redirect https://www.mic.com/robots.txt
Redirect Domain www.mic.com
Redirect Base mic.com
Domain IPs 13.35.238.114, 13.35.238.57, 13.35.238.90, 13.35.238.97
Redirect IPs 13.35.238.114, 13.35.238.57, 13.35.238.90, 13.35.238.97
Response IP 13.35.238.114
Found Yes
Hash aefb3c8e9cd5700a89dbbf916ca09f6bd6171205b126a2532bc6516775877704
SimHash 2b6dc1e1cb03

Groups

*

Rule Path
Disallow
Disallow /search?*

amazonbot
applebot
applebot-extended
gptbot
newswhipbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mic.com/sitemaps/pages.xml
sitemap https://www.mic.com/sitemaps/news.xml