getaroom.com
robots.txt

Robots Exclusion Standard data for getaroom.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	getaroom.com
Base Domain	getaroom.com
Scan Status	Ok
Last Scan	2024-06-09T10:04:31+00:00
Next Scan	2024-07-09T10:04:31+00:00

Last Scan

Scanned	2024-06-09T10:04:31+00:00
URL	https://getaroom.com/robots.txt
Redirect	https://www.getaroom.com/robots.txt
Redirect Domain	www.getaroom.com
Redirect Base	getaroom.com
Domain IPs	34.200.118.55, 44.206.96.55, 44.218.201.61
Redirect IPs	34.200.118.55, 44.206.96.55, 44.218.201.61
Response IP	44.218.201.61
Found	Yes
Hash	7d27e981c8bace04f1920dfb97667f5ca78437320ede5a8f5c626b77563f8815
SimHash	a2cd0d456555

Groups

*

Rule	Path
Disallow	/reservations/*
Disallow	/av/
Allow	/reservations/$

Rule

Path

Disallow

/reservations/*

Disallow

/av/

Allow

/reservations/$

yahoo!-adcrawler

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

2

Back to top

Other Records

Field	Value
sitemap	https://www.getaroom.com/sitemaps.xml
sitemap	https://m.getaroom.com/sitemaps.xml

Field

Value

sitemap

https://www.getaroom.com/sitemaps.xml

sitemap

https://m.getaroom.com/sitemaps.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /

Back to top

getaroom.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

yahoo!-adcrawler

Other Records

Other Records

Comments

getaroom.com
robots.txt