inc42.com
robots.txt

Robots Exclusion Standard data for inc42.com

Resource Scan

Scan Details

Site Domain inc42.com
Base Domain inc42.com
Scan Status Ok
Last Scan2024-11-09T05:53:10+00:00
Next Scan 2024-11-16T05:53:10+00:00

Last Scan

Scanned2024-11-09T05:53:10+00:00
URL https://inc42.com/robots.txt
Domain IPs 104.26.12.104, 104.26.13.104, 172.67.75.188, 2606:4700:20::681a:c68, 2606:4700:20::681a:d68, 2606:4700:20::ac43:4bbc
Response IP 104.26.12.104
Found Yes
Hash fad15f08701d69f645a83fa89996e84e944490c8a1d56470ecb8fccf4a512b13
SimHash 490598608b33

Groups

*

Rule Path
Disallow /readme.html
Disallow /wp-admin/
Allow /?display=wide
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /*?*
Disallow /*/0/*
Allow /*?ver*

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://inc42.com/sitemap_index.xml
sitemap https://inc42.com/news-sitemap.xml
sitemap https://inc42.com/web-story-sitemap.xml