newarktrust.org
robots.txt

Robots Exclusion Standard data for newarktrust.org

Resource Scan

Scan Details

Site Domain newarktrust.org
Base Domain newarktrust.org
Scan Status Ok
Last Scan2026-03-25T01:25:05+00:00
Next Scan 2026-04-24T01:25:05+00:00

Last Scan

Scanned2026-03-25T01:25:05+00:00
URL https://newarktrust.org/robots.txt
Redirect https://www.kirikuylabruja.com/robots.txt
Redirect Domain www.kirikuylabruja.com
Redirect Base kirikuylabruja.com
Domain IPs 104.21.21.192, 172.67.200.13, 2606:4700:3035::ac43:c80d, 2606:4700:3037::6815:15c0
Redirect IPs 104.21.93.87, 172.67.208.47, 2606:4700:3030::6815:5d57, 2606:4700:3035::ac43:d02f
Response IP 172.67.208.47
Found Yes
Hash 418670e791ebffe20ec4a77ce7c1509997100a22eaa859f38b997e08c9f26c35
SimHash eb4ff8eaea1b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/cache/

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webdownloader

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offlineexplorer

Rule Path
Disallow /

httrack

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

httpweazel

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.kirikuylabruja.com/sitemap.xml