paperap.com
robots.txt

Robots Exclusion Standard data for paperap.com

Resource Scan

Scan Details

Site Domain paperap.com
Base Domain paperap.com
Scan Status Ok
Last Scan2025-10-20T16:45:56+00:00
Next Scan 2025-10-27T16:45:56+00:00

Last Scan

Scanned2025-10-20T16:45:56+00:00
URL https://paperap.com/robots.txt
Domain IPs 104.21.21.206, 172.67.200.85, 2606:4700:3031::ac43:c855, 2606:4700:3033::6815:15ce
Response IP 104.21.21.206
Found Yes
Hash 23c1c509d07dadaf43bab0829f1c5bbabde67c809e66fd99fb637deb8fb9b54a
SimHash 6134b1501e31

Groups

*

Rule Path
Disallow /http*
Disallow *?attachment_id=
Disallow /wp-admin*
Disallow /wp-json*
Disallow /cgi-bin
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed/
Disallow */rss
Disallow */embed
Disallow */wlwmanifest.xml
Disallow /xmlrpc.php
Disallow utm%3D
Disallow *openstat%3D
Allow */uploads

Other Records

Field Value
sitemap https://paperap.com/sitemap_index.xml

Warnings

  • `host` is not a known field.