commonfloor.com
robots.txt

Robots Exclusion Standard data for commonfloor.com

Resource Scan

Scan Details

Site Domain commonfloor.com
Base Domain commonfloor.com
Scan Status Ok
Last Scan2024-11-04T22:42:32+00:00
Next Scan 2024-11-11T22:42:32+00:00

Last Scan

Scanned2024-11-04T22:42:32+00:00
URL https://commonfloor.com/robots.txt
Redirect https://www.commonfloor.com/robots.txt
Redirect Domain www.commonfloor.com
Redirect Base commonfloor.com
Domain IPs 23.36.49.238
Redirect IPs 23.36.49.238
Response IP 23.54.58.82
Found Yes
Hash 677746b7da4d7f3909b2325a4b73808b42ca8040279a0251ee7731362ef9eff6
SimHash 927f891583d3

Groups

*

Rule Path
Disallow /6225870/
Disallow /_s/
Disallow /nm/
Disallow /sms-service*
Disallow /apartments-for-sale?search_intent*
Disallow /apartments-for-rent?search_intent*
Disallow /index/search*
Disallow /for_sale*
Disallow /for_rent*
Disallow /property-search*
Disallow /listing-search*
Disallow /project-search*
Disallow /sem*
Disallow /weekend-guide*
Disallow /channel*
Disallow /agent/*/cfap-*/listings-sale
Disallow /agent/*/cfap-*/listings-rent
Disallow /location-listing/*
Disallow /property-listing-public/*
Disallow /authorize/
Disallow /freetext-search?*
Disallow /search?*

psbot/0.1

Rule Path
Disallow /

twiceler www.cuill.com/robots.html

Rule Path
Disallow /

twiceler-0.9 http://www.cuill.com/twiceler/robot.html

Rule Path
Disallow /

dwaarbot+(dwaarbot@dwaar.com)

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

giga�mega.bot/1.0; +http://www.giga�mega.net/bot.html

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

baidu

Rule Path
Disallow /

speedy

Rule Path
Disallow /

iiitbot/1.1 (indian language web search engine; http://webkhoj.iiit.net; pvvpr at iiit dot ac dot in)

Rule Path
Disallow /

boitho.com-dc

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

seekbot

Rule Path
Disallow /

pete-spider light

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.commonfloor.com/sitemap/sitemap_index.xml