buildingaaya.com
robots.txt

Robots Exclusion Standard data for buildingaaya.com

Resource Scan

Scan Details

Site Domain buildingaaya.com
Base Domain buildingaaya.com
Scan Status Ok
Last Scan2025-05-21T09:19:10+00:00
Next Scan 2025-05-28T09:19:10+00:00

Last Scan

Scanned2025-05-21T09:19:10+00:00
URL https://buildingaaya.com/robots.txt
Redirect https://www.buildingaaya.com/robots.txt
Redirect Domain www.buildingaaya.com
Redirect Base buildingaaya.com
Domain IPs 147.93.109.176, 2a02:4780:11:1975:0:11c9:7a96:2
Redirect IPs 147.93.109.176, 2a02:4780:11:1975:0:11c9:7a96:2
Response IP 147.93.109.176
Found Yes
Hash 26c7ed3bd3bb0ee1e0fd68571dbe83aac968a5c73190c8abe8a126cdf8ea7fa4
SimHash 485d7bd003f9

Groups

httrack

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

wget

Rule Path
Disallow /

octoparse

Rule Path
Disallow /

parsehub

Rule Path
Disallow /

webharvy

Rule Path
Disallow /

contentgrabber

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

apify

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

dataminer

Rule Path
Disallow /

import.io

Rule Path
Disallow /

zyte

Rule Path
Disallow /

simplescraper

Rule Path
Disallow /

fminer

Rule Path
Disallow /

scraperapi

Rule Path
Disallow /

*

Rule Path
Disallow /*?beautifulsoup*
Disallow /*?selenium*
Disallow /*?puppeteer*
Disallow /*?cheerio*
Disallow /*?webscraper*
Disallow /*?instantdatascraper*

googlebot

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /my/
Disallow /private/
Disallow /admin/
Disallow /*.pdf$

Other Records

Field Value
sitemap https://www.buildingaaya.com/sitemap.xml

Comments

  • Block specific scraping tools
  • Block query string scrapers with wildcards
  • General rules for all other bots (including Google)