generalsync.com
robots.txt

Robots Exclusion Standard data for generalsync.com

Resource Scan

Scan Details

Site Domain generalsync.com
Base Domain generalsync.com
Scan Status Ok
Last Scan2025-11-22T19:10:17+00:00
Next Scan 2025-12-22T19:10:17+00:00

Last Scan

Scanned2025-11-22T19:10:17+00:00
URL https://generalsync.com/robots.txt
Domain IPs 144.76.63.4
Response IP 144.76.63.4
Found Yes
Hash ce5db44cd2333b18aab19c2092254b5692b2ad3d017c84de621767712c1d48f8
SimHash e139974bce89

Groups

*

Rule Path
Disallow

adsbot-google*
googlebot*

Rule Path
Disallow /res/homepage/nocloud.png
Disallow /res/homepage/team.png

Other Records

Field Value
sitemap https://generalsync.com/sitemap.xml
sitemap https://generalsync.com/sitemap.xml

Comments

  • This domain serves mainly static content, and thus should be indexed in full
  • Exception: large background images somehow mess up Google's algorithms, so
  • we spare the bot the confusion as they're not really important anyway.