pedomanharian.org
robots.txt

Robots Exclusion Standard data for pedomanharian.org

Resource Scan

Scan Details

Site Domain pedomanharian.org
Base Domain pedomanharian.org
Scan Status Ok
Last Scan2025-11-12T16:30:40+00:00
Next Scan 2025-12-12T16:30:40+00:00

Last Scan

Scanned2025-11-12T16:30:40+00:00
URL https://pedomanharian.org/robots.txt
Domain IPs 13.35.238.29, 13.35.238.31, 13.35.238.39, 13.35.238.43
Response IP 13.35.238.29
Found Yes
Hash ec86d88fe865bd8f6be4776f57daefd52b455e3d6a1e21dda32b06e90543d406
SimHash c921e88ce51a

Groups

*

Rule Path
Disallow /api/
Disallow /content/
Disallow /xmlrpc.php
Disallow /wp-*
Disallow /?s=*
Disallow /getprint/
Disallow /*-getprint/
Disallow /getprint
Disallow /*-getprint
Disallow /custom-post-types/subscription/
Disallow /page/*
Disallow /page/*/