media.net
robots.txt
Robots Exclusion Standard data for media.net
Resource Scan
Scan Details
Site Domain | media.net |
Base Domain | media.net |
Scan Status | Ok |
Last Scan | 2024-10-20T04:21:40+00:00 |
Next Scan | 2024-11-19T04:21:40+00:00 |
Last Scan
Scanned | 2024-10-20T04:21:40+00:00 |
URL | https://media.net/robots.txt |
Redirect | https://www.media.net/robots.txt |
Redirect Domain | www.media.net |
Redirect Base | media.net |
Domain IPs | 3.226.3.35 |
Redirect IPs | 3.226.3.35 |
Response IP | 3.226.3.35 |
Found | Yes |
Hash | 61bf2984437358d9dae0a595e8c71c83073e7d59a56edd09d44a2a00061795ea |
SimHash | 2b1544058b81 |
Groups
*
Rule | Path |
---|---|
Disallow | /captcha.php |
Disallow | /captcha.js |
Disallow | /*reward*?ha |
Disallow | /*program*?ha |
Disallow | /*join*?ha |
Disallow | /*signup*?ha |
Disallow | /podcast |
Disallow | /tcfv2/ |
Disallow | /downloads/supply/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.media.net/sitemap.xml |