newhr.org
robots.txt

Robots Exclusion Standard data for newhr.org

Resource Scan

Scan Details

Site Domain newhr.org
Base Domain newhr.org
Scan Status Ok
Last Scan2024-05-20T20:49:14+00:00
Next Scan 2024-06-19T20:49:14+00:00

Last Scan

Scanned2024-05-20T20:49:14+00:00
URL https://newhr.org/robots.txt
Domain IPs 104.21.24.101, 172.67.218.45, 2606:4700:3031::ac43:da2d, 2606:4700:3037::6815:1865
Response IP 172.67.218.45
Found Yes
Hash b596a7fe78f78e748f8f010e2551c8402f4a87992dbd109f468402e77bef0e24
SimHash bb29fa63aaf1

Groups

*

Rule Path
Disallow /page18841165.html
Disallow /header
Disallow /page18991438.html
Disallow /footer
Disallow /tilda/form*
Disallow /tilda/rec*
Disallow /tilda/click*
Disallow /tilda/scroll*
Disallow /tilda/popup*
Disallow /tilda/cart*
Disallow /tilda/product*
Disallow /tilda/event*
Disallow /*_escaped_fragment_*
Disallow /members/login*
Disallow /members/signup*
Disallow

Other Records

Field Value
sitemap https://newhr.org/sitemap.xml
sitemap https://newhr.org/sitemap-feeds.xml

Warnings

  • `host` is not a known field.