geetmanjusha.com
robots.txt
Robots Exclusion Standard data for geetmanjusha.com
Resource Scan
Scan Details
Site Domain | geetmanjusha.com |
Base Domain | geetmanjusha.com |
Scan Status | Ok |
Last Scan | 2025-10-08T03:13:59+00:00 |
Next Scan | 2025-10-15T03:13:59+00:00 |
Last Scan
Scanned | 2025-10-08T03:13:59+00:00 |
URL | https://geetmanjusha.com/robots.txt |
Domain IPs | 104.21.10.188, 172.67.131.177, 2606:4700:3031::6815:abc, 2606:4700:3031::ac43:83b1 |
Response IP | 104.21.10.188 |
Found | Yes |
Hash | 496226dad2e4dd5d4cbdfc90c65968ca5556ff374490dc37b42b38a5f0604b93 |
SimHash | a96594746958 |
Groups
*
Rule | Path |
---|---|
Disallow | /assets/ |
Disallow | /cache/ |
Disallow | /components/ |
Disallow | /includes/ |
Disallow | /language/ |
Disallow | /libraries/ |
Disallow | /log/ |
Disallow | /media/ |
Disallow | /plugins/ |
Disallow | /templates/ |
Disallow | /tmp/ |
Disallow | /vendor/ |
Disallow | /hindi/movie/ |
Disallow | /hindi/singer/ |
Disallow | /hindi/musician/ |
Disallow | /hindi/lyricswriter/ |
Disallow | /hindi/actor/ |
Disallow | /marathi/movie/ |
Disallow | /marathi/singer/ |
Disallow | /marathi/musician/ |
Disallow | /marathi/lyricswriter/ |
Disallow | /marathi/actor/ |