buildingaaya.com
robots.txt

Robots Exclusion Standard data for buildingaaya.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	buildingaaya.com
Base Domain	buildingaaya.com
Scan Status	Ok
Last Scan	2025-12-30T14:31:49+00:00
Next Scan	2026-01-06T14:31:49+00:00

Last Scan

Scanned	2025-12-30T14:31:49+00:00
URL	https://buildingaaya.com/robots.txt
Redirect	https://www.buildingaaya.com/robots.txt
Redirect Domain	www.buildingaaya.com
Redirect Base	buildingaaya.com
Domain IPs	147.93.109.176, 2a02:4780:11:1975:0:11c9:7a96:2
Redirect IPs	147.93.109.176, 2a02:4780:11:1975:0:11c9:7a96:2
Response IP	147.93.109.176
Found	Yes
Hash	26c7ed3bd3bb0ee1e0fd68571dbe83aac968a5c73190c8abe8a126cdf8ea7fa4
SimHash	485d7bd003f9

Groups

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

octoparse

Rule	Path
Disallow	/

Rule

Path

Disallow

parsehub

Rule	Path
Disallow	/

Rule

Path

Disallow

webharvy

Rule	Path
Disallow	/

Rule

Path

Disallow

contentgrabber

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

apify

Rule	Path
Disallow	/

Rule

Path

Disallow

spinn3r

Rule	Path
Disallow	/

Rule

Path

Disallow

dataminer

Rule	Path
Disallow	/

Rule

Path

Disallow

import.io

Rule	Path
Disallow	/

Rule

Path

Disallow

zyte

Rule	Path
Disallow	/

Rule

Path

Disallow

simplescraper

Rule	Path
Disallow	/

Rule

Path

Disallow

fminer

Rule	Path
Disallow	/

Rule

Path

Disallow

scraperapi

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/?beautifulsoup
Disallow	/?selenium
Disallow	/?puppeteer
Disallow	/?cheerio
Disallow	/?webscraper
Disallow	/?instantdatascraper

Rule

Path

Disallow

/*?beautifulsoup*

Disallow

/*?selenium*

Disallow

/*?puppeteer*

Disallow

/*?cheerio*

Disallow

/*?webscraper*

Disallow

/*?instantdatascraper*

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Allow	/
Disallow	/my/
Disallow	/private/
Disallow	/admin/
Disallow	/*.pdf$

Rule

Path

Allow

Disallow

/my/

Disallow

/private/

Disallow

/admin/

Disallow

/*.pdf$

Other Records

Field	Value
sitemap	https://www.buildingaaya.com/sitemap.xml

Field

Value

sitemap

https://www.buildingaaya.com/sitemap.xml

Comments

Block specific scraping tools
Block query string scrapers with wildcards
General rules for all other bots (including Google)

buildingaaya.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

httrack

scrapy

wget

octoparse

parsehub

webharvy

contentgrabber

diffbot

apify

spinn3r

dataminer

import.io

zyte

simplescraper

fminer

scraperapi

*

googlebot

*

Other Records

Comments

buildingaaya.com
robots.txt