jaruwa.com
robots.txt

Robots Exclusion Standard data for jaruwa.com

Resource Scan

Scan Details

Site Domain jaruwa.com
Base Domain jaruwa.com
Scan Status Ok
Last Scan2024-04-25T11:46:17+00:00
Next Scan 2024-05-25T11:46:17+00:00

Last Scan

Scanned2024-04-25T11:46:17+00:00
URL https://jaruwa.com/robots.txt
Domain IPs 104.21.71.89, 172.67.170.76, 2606:4700:3033::ac43:aa4c, 2606:4700:3035::6815:4759
Response IP 104.21.71.89
Found Yes
Hash 9ff819a24f4f60a0fa45c5e15e0d533974de4b5d22011466be236058bcbe9d6e
SimHash b80ad4cf2d39

Groups

*

Rule Path
Disallow /register
Disallow /local
Disallow /assets
Disallow /data
Disallow /full
Disallow /123456
Disallow /client

*

Rule Path
Allow /who-we-are
Allow /who-we-are/about-us
Allow /who-we-are/meet-our-team
Allow /services
Allow /blog
Allow /gallery
Allow /privacy-and-policy
Allow /contact-us

Other Records

Field Value
sitemap https://www.jaruwa.com/sitemap.xml