joincivil.com
robots.txt

Robots Exclusion Standard data for joincivil.com

Resource Scan

Scan Details

Site Domain joincivil.com
Base Domain joincivil.com
Scan Status Ok
Last Scan2025-09-25T12:55:33+00:00
Next Scan 2025-10-25T12:55:33+00:00

Last Scan

Scanned2025-09-25T12:55:33+00:00
URL https://joincivil.com/robots.txt
Domain IPs 104.21.59.76, 172.67.218.162, 2606:4700:3031::ac43:daa2, 2606:4700:3035::6815:3b4c
Response IP 172.67.218.162
Found Yes
Hash 8d55627f70058b77f47ebe865584e507ea36f8d45853d76f8d186dc6b6d7a86b
SimHash 48445c825792

Groups

*

Rule Path
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /trackback/
Disallow /wp-admin/
Disallow /*.inc$
Disallow */trackback/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://joincivil.com/sitemap.xml