comradeweb.com
robots.txt

Robots Exclusion Standard data for comradeweb.com

Resource Scan

Scan Details

Site Domain comradeweb.com
Base Domain comradeweb.com
Scan Status Ok
Last Scan2026-01-04T00:26:57+00:00
Next Scan 2026-02-03T00:26:57+00:00

Last Scan

Scanned2026-01-04T00:26:57+00:00
URL https://comradeweb.com/robots.txt
Domain IPs 104.26.8.71, 104.26.9.71, 172.67.74.198, 2606:4700:20::681a:847, 2606:4700:20::681a:947, 2606:4700:20::ac43:4ac6
Response IP 172.67.74.198
Found Yes
Hash c84a248153496b3f2038a835becefe8bcde4799783ccf4cff6ea9068e9542d36
SimHash e5219840dfeb

Groups

*

Rule Path
Allow /wp-content/*
Allow /wp-admin/admin-ajax.php
Allow /wp-includes/*
Allow /wp-json/*
Disallow /wp-admin/
Disallow /cgi-bin
Disallow /*?
Disallow /wp-
Disallow /search
Disallow /author/
Disallow */rss
Disallow */embed
Disallow /*?utm
Disallow /*?source
Disallow /*?wordfence

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/

Other Records

Field Value
sitemap https://comradeweb.com/sitemap_index.xml

Comments

  • WP Import Export Rule