magenet.com
robots.txt

Robots Exclusion Standard data for magenet.com

Resource Scan

Scan Details

Site Domain magenet.com
Base Domain magenet.com
Scan Status Ok
Last Scan2024-09-14T23:07:23+00:00
Next Scan 2024-10-14T23:07:23+00:00

Last Scan

Scanned2024-09-14T23:07:23+00:00
URL https://magenet.com/robots.txt
Domain IPs 104.21.44.227, 172.67.204.190, 2606:4700:3033::6815:2ce3, 2606:4700:3035::ac43:ccbe
Response IP 104.21.44.227
Found Yes
Hash 032cb74f2286197a428b617d8cd238fc841d6787568b3db0e17a30d07dd2b5fb
SimHash 6151c5496429

Groups

etaospider

Rule Path
Disallow /

nutch

Rule Path
Disallow /

larbin

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /view/privacy_policy
Disallow /view/TOS
Disallow /wp-content/plugins/
Disallow /tag/
Disallow /2012/
Disallow /2013/
Disallow /2014/
Disallow /feed/
Disallow /trackback/
Disallow /thank_you_confirm
Disallow /thank_you_confirmed
Disallow /terms-of-use
Disallow /terms-of-use/
Disallow /privacy-policy
Disallow /about-us-slides/
Disallow /reviews/
Disallow */faq-category/*
Disallow */faqs/category/*
Disallow /terms-of-use-page
Disallow /1-2
Disallow /?-u=%7B%3F%24auth_key%3F%7D
Disallow /?calculator_category_list=1
Disallow /blog/page/
Disallow /contact-us/
Allow /wp-content/uploads/

Other Records

Field Value
sitemap https://www.magenet.com/sitemap_index.xml