natca.org
robots.txt

Robots Exclusion Standard data for natca.org

Resource Scan

Scan Details

Site Domain natca.org
Base Domain natca.org
Scan Status Ok
Last Scan2024-11-11T15:05:39+00:00
Next Scan 2024-12-11T15:05:39+00:00

Last Scan

Scanned2024-11-11T15:05:39+00:00
URL https://natca.org/robots.txt
Domain IPs 104.26.4.229, 104.26.5.229, 172.67.68.72, 2606:4700:20::681a:4e5, 2606:4700:20::681a:5e5, 2606:4700:20::ac43:4448
Response IP 172.67.68.72
Found Yes
Hash 571e1db0157890fedd09eb0f94a6a02a997dfe1b69871c3eef6bb1d2f982b70b
SimHash 420d8c00e238

Groups

*

Rule Path
Disallow /wp-admin/

Other Records

Field Value
crawl-delay 3

ahrefsbot
semrushbot
bidubrowser
bytedance
bytespider
liveaerosearchengine
mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.natca.org/sitemap_index.xml