weblineindia.com
robots.txt
Robots Exclusion Standard data for weblineindia.com
Resource Scan
Scan Details
Site Domain | weblineindia.com |
Base Domain | weblineindia.com |
Scan Status | Ok |
Last Scan | 2024-05-21T15:05:18+00:00 |
Next Scan | 2024-06-20T15:05:18+00:00 |
Last Scan
Scanned | 2024-05-21T15:05:18+00:00 |
URL | https://weblineindia.com/robots.txt |
Redirect | https://www.weblineindia.com/robots.txt |
Redirect Domain | www.weblineindia.com |
Redirect Base | weblineindia.com |
Domain IPs | 13.33.88.18, 13.33.88.27, 13.33.88.30, 13.33.88.32 |
Redirect IPs | 13.33.88.18, 13.33.88.27, 13.33.88.30, 13.33.88.32 |
Response IP | 13.33.88.18 |
Found | Yes |
Hash | f06b6e0cd64f5fd1f3cd18f73bfb01d49f0d1c3f72a294972bba66f2a7b1bf14 |
SimHash | e615dc40c492 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /promo/ |
Disallow | /development/ |
Disallow | /offshore/ |
Disallow | /design/ |
Disallow | /signature_img/ |
Disallow | /email-campaign/ |
Disallow | /business-trip/ |
Disallow | /OLD_PAGES/ |
Disallow | /landingpages/ |
Disallow | /apps-privacy-policy/ |
Disallow | /titbits/ |
Disallow | /common-tech/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.weblineindia.com/sitemap.xml |