exitmedia.net
robots.txt

Robots Exclusion Standard data for exitmedia.net

Resource Scan

Scan Details

Site Domain exitmedia.net
Base Domain exitmedia.net
Scan Status Ok
Last Scan2026-01-09T01:15:43+00:00
Next Scan 2026-02-08T01:15:43+00:00

Last Scan

Scanned2026-01-09T01:15:43+00:00
URL https://exitmedia.net/robots.txt
Domain IPs 185.34.194.5
Response IP 185.34.194.5
Found Yes
Hash d96c36236e07f2db2fc40c09f477d9c26fc1223c3c6d483461e83a2b052aaee0
SimHash 68304c42c7b4

Groups

*

Rule Path
Disallow /wp-content/revistas/
Disallow /adserver/
Disallow /wp-icludes/
Disallow /trackback/
Disallow /wp-admin/
Disallow /login/
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.php$

Other Records

Field Value
crawl-delay 1

all

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap https://exitmedia.net/sitemap_index.xml