machiswala.com
robots.txt
Robots Exclusion Standard data for machiswala.com
Resource Scan
Scan Details
| Site Domain | machiswala.com |
| Base Domain | machiswala.com |
| Scan Status | Ok |
| Last Scan | 2025-11-23T22:47:21+00:00 |
| Next Scan | 2025-12-23T22:47:21+00:00 |
Last Scan
| Scanned | 2025-11-23T22:47:21+00:00 |
| URL | https://machiswala.com/robots.txt |
| Domain IPs | 104.21.27.43, 172.67.168.229, 2606:4700:3031::6815:1b2b, 2606:4700:3033::ac43:a8e5 |
| Response IP | 104.21.27.43 |
| Found | Yes |
| Hash | 06b07f6130f450fe0679eebfa39d0223142e47d6b3a7b6b5543c39639aef99cc |
| SimHash | c6354b53cd95 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
ahrefsbot
semrushbot
mj12bot
webcopier
offline explorer pro
offline explorer
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow | /video/* |
| Disallow | /dieu-khoan.html |
| Disallow | /lien-he.html |
| Disallow | /admin/ |
| Disallow | /api/* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://machiswala.com/abcccc-sitemap.xml |
Warnings
- `content-signal` is not a known field.
Comments