groupia.com
robots.txt

Robots Exclusion Standard data for groupia.com

Resource Scan

Scan Details

Site Domain groupia.com
Base Domain groupia.com
Scan Status Ok
Last Scan2025-07-04T13:53:35+00:00
Next Scan 2025-07-11T13:53:35+00:00

Last Scan

Scanned2025-07-04T13:53:35+00:00
URL https://groupia.com/robots.txt
Redirect https://www.groupia.com/robots.txt
Redirect Domain www.groupia.com
Redirect Base groupia.com
Domain IPs 217.160.0.122
Redirect IPs 217.160.0.122
Response IP 217.160.0.122
Found Yes
Hash 56358c9357e5f3fbbd89805b8796e7f97623cea193dbcab6a9c410ee3879fe99
SimHash 603c916147d8

Groups

*

Rule Path
Disallow /privacy-protection.php
Disallow /terms.php
Disallow /faq-claims.php
Disallow /email/
Disallow /testing/
Disallow /suppliers/suppliers-accommodation
Disallow /suppliers/suppliers-activities
Disallow /suppliers/suppliers-nightlife
Disallow /suppliers/suppliers-restaurant
Disallow /suppliers/suppliers-selfcatering
Disallow /suppliers/suppliers-transport
Disallow /enquire/
Disallow /property/
Disallow /adventure-weekends/package-results.php
Disallow /adventure-weekends/package.php
Disallow /favourites/results
Disallow /ota-terms.php
Disallow /partners

Other Records

Field Value
sitemap https://www.groupia.com/sitemap.xml

Comments

  • list folders robots are not allowed to index
  • list specific files robots are not allowed to index