nishari.com
robots.txt
Robots Exclusion Standard data for nishari.com
Resource Scan
Scan Details
| Site Domain | nishari.com |
| Base Domain | nishari.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a server error. |
| Last Scan | 2025-11-20T07:33:59+00:00 |
| Next Scan | 2025-12-20T07:33:59+00:00 |
Last Successful Scan
| Scanned | 2025-10-21T05:19:26+00:00 |
| URL | https://nishari.com/robots.txt |
| Domain IPs | 104.21.39.16, 172.67.142.31, 2606:4700:3031::6815:2710, 2606:4700:3037::ac43:8e1f |
| Response IP | 104.21.39.16 |
| Found | Yes |
| Hash | 217927ba7a35b48cb4df1128f43d4a78753415d802a2f2c1fc988286e84eeaf2 |
| SimHash | f64f7c867c11 |
Groups
teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
ahrefsbot
semrushbot
mj12bot
webcopier
offline explorer pro
offline explorer
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow | /video/* |
| Disallow | /admin/ |
| Disallow | /dieu-khoan.html |
| Disallow | /lien-he.html |
| Disallow | /api/* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://nishari.com/abcccc-sitemap.xml |