arlnow.com
robots.txt

Robots Exclusion Standard data for arlnow.com

Resource Scan

Scan Details

Site Domain arlnow.com
Base Domain arlnow.com
Scan Status Ok
Last Scan2024-09-21T01:58:26+00:00
Next Scan 2024-09-28T01:58:26+00:00

Last Scan

Scanned2024-09-21T01:58:26+00:00
URL https://arlnow.com/robots.txt
Redirect https://www.arlnow.com/robots.txt
Redirect Domain www.arlnow.com
Redirect Base arlnow.com
Domain IPs 104.20.2.31, 104.20.3.31, 172.67.34.184, 2606:4700:10::6814:21f, 2606:4700:10::6814:31f, 2606:4700:10::ac43:22b8
Redirect IPs 104.20.2.31, 104.20.3.31, 172.67.34.184, 2606:4700:10::6814:21f, 2606:4700:10::6814:31f, 2606:4700:10::ac43:22b8
Response IP 104.20.2.31
Found Yes
Hash c6b473be2a39561e3328799da37450c809b44872418a817a817602f1f7497521
SimHash 5a01d86189b2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.arlnow.com/sitemap.xml
sitemap https://www.arlnow.com/sitemap.rss