insidermonkey.com
robots.txt

Robots Exclusion Standard data for insidermonkey.com

Resource Scan

Scan Details

Site Domain insidermonkey.com
Base Domain insidermonkey.com
Scan Status Ok
Last Scan2024-11-13T18:22:18+00:00
Next Scan 2024-11-20T18:22:18+00:00

Last Scan

Scanned2024-11-13T18:22:18+00:00
URL https://insidermonkey.com/robots.txt
Redirect https://www.insidermonkey.com:443/robots.txt
Redirect Domain www.insidermonkey.com
Redirect Base insidermonkey.com
Domain IPs 13.248.131.72, 76.223.4.169
Redirect IPs 13.248.131.72, 76.223.4.169
Response IP 76.223.4.169
Found Yes
Hash 451ae7de6e4a9540d00a48e5279c0c5fccc2e4d054c863d86ec49b96effbc211
SimHash 6c4c9871ef37

Groups

*

Rule Path
Disallow /*.txt$
Disallow /*.sql$
Disallow /*.sample$
Disallow /*.md$
Disallow /blog/*preview%3Dtrue*

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

googlebot

Rule Path
Allow /ads.txt

Other Records

Field Value
sitemap https://www.insidermonkey.com/xml-sitemaps/index.xml
sitemap https://www.insidermonkey.com/blog/sitemap.xml
sitemap https://www.insidermonkey.com/blog/google-news-sitemap.xml