ppsinteriorsblog.com
robots.txt
Robots Exclusion Standard data for ppsinteriorsblog.com
Resource Scan
Scan Details
| Site Domain | ppsinteriorsblog.com |
| Base Domain | ppsinteriorsblog.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a server error. |
| Last Scan | 2025-12-30T19:02:44+00:00 |
| Next Scan | 2026-01-13T19:02:44+00:00 |
Last Successful Scan
| Scanned | 2025-11-22T19:02:05+00:00 |
| URL | https://ppsinteriorsblog.com/robots.txt |
| Domain IPs | 104.21.85.235, 172.67.212.24, 2606:4700:3032::ac43:d418, 2606:4700:3033::6815:55eb |
| Response IP | 104.21.85.235 |
| Found | Yes |
| Hash | 124dff9bfb2d5419aa5d1ee218c9b9e92f8e1588773d8a167a7dd71ada10b2f6 |
| SimHash | f74f7cc67c91 |
Groups
teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
ahrefsbot
semrushbot
mj12bot
webcopier
offline explorer pro
offline explorer
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow | /video/* |
| Disallow | /admin/ |
| Disallow | /dieu-khoan.html |
| Disallow | /lien-he.html |
| Disallow | /api/* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://ppsinteriorsblog.com/abcccc-sitemap.xml |