crewnetwork.org
robots.txt

Robots Exclusion Standard data for crewnetwork.org

Resource Scan

Scan Details

Site Domain crewnetwork.org
Base Domain crewnetwork.org
Scan Status Ok
Last Scan2024-09-12T23:40:31+00:00
Next Scan 2024-10-12T23:40:31+00:00

Last Scan

Scanned2024-09-12T23:40:31+00:00
URL https://crewnetwork.org/robots.txt
Domain IPs 76.76.21.21
Response IP 76.76.21.21
Found Yes
Hash 8351a20443d6d61c563e9641ae97c24852eaff9033086a97f20aef7bd659ecc6
SimHash 4f4dfac2f8b2

Groups

*

Rule Path
Disallow /api/*
Disallow /special-pages/*
Disallow /deployment-monitor

mj12bot

Rule Path
Disallow /

buck

Rule Path
Disallow /

*

Rule Path
Disallow /getmedia/*
Disallow /CrewNetwork/media/*
Disallow /wp-login.php
Disallow /wp-content/*
Disallow /wp-admin/*
Disallow /files/*

buck

Rule Path
Disallow /

Other Records

Field Value Comment
sitemap crewnetwork.org/sitemap_index.xml Index sitemap
sitemap crewnetwork.org/sitemap.xml -
sitemap crewnetwork.org/en/sitemap.xml -

Comments

  • Sitemaps