434579.app.netsuite.com
robots.txt

Robots Exclusion Standard data for 434579.app.netsuite.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	434579.app.netsuite.com
Base Domain	netsuite.com
Scan Status	Ok
Last Scan	2024-11-05T07:35:30+00:00
Next Scan	2024-11-19T07:35:30+00:00

Last Scan

Scanned	2024-11-05T07:35:30+00:00
URL	https://434579.app.netsuite.com/robots.txt
Domain IPs	23.73.12.153
Response IP	23.207.180.161
Found	Yes
Hash	7f2607efb70f98b41a00b1836bcf3b3f3dcf2f6ab55673c167f944b3b4f339a4
SimHash	181521c749d1

Groups

googlebot

Rule	Path
Disallow	/site/
Disallow	/catalog
Disallow	/campaigns
Disallow	/web/thanks
Disallow	/web/__MACOSX
Disallow	/mdr4.html
Disallow	/mdr3.html
Disallow	/licenseOLD.html
Disallow	/license.html
Disallow	/keepalive.html
Disallow	/blank.html
Disallow	/732.HTML
Disallow	/729.html
Disallow	/728.HTML
Disallow	/726.HTML
Disallow	/725.HTML
Disallow	/724.html
Disallow	/723.html
Disallow	/722.html

Rule

Path

Disallow

/site/

Disallow

/catalog

Disallow

/campaigns

Disallow

/web/thanks

Disallow

/web/__MACOSX

Disallow

/mdr4.html

Disallow

/mdr3.html

Disallow

/licenseOLD.html

Disallow

/license.html

Disallow

/keepalive.html

Disallow

/blank.html

Disallow

/732.HTML

Disallow

/729.html

Disallow

/728.HTML

Disallow

/726.HTML

Disallow

/725.HTML

Disallow

/724.html

Disallow

/723.html

Disallow

/722.html

googlebot-image

Rule	Path
Disallow

Rule

Path

Disallow

adsbot-google

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow	/site/
Disallow	/catalog
Disallow	/campaigns
Disallow	/web/css
Disallow	/web/js
Disallow	/web/thanks
Disallow	/web/__MACOSX
Disallow	/web/images
Disallow	/mdr4.html
Disallow	/mdr3.html
Disallow	/licenseOLD.html
Disallow	/license.html
Disallow	/keepalive.html
Disallow	/blank.html
Disallow	/732.HTML
Disallow	/729.html
Disallow	/728.HTML
Disallow	/726.HTML
Disallow	/725.HTML
Disallow	/724.html
Disallow	/723.html
Disallow	/722.html

Rule

Path

Disallow

/site/

Disallow

/catalog

Disallow

/campaigns

Disallow

/web/css

Disallow

/web/js

Disallow

/web/thanks

Disallow

/web/__MACOSX

Disallow

/web/images

Disallow

/mdr4.html

Disallow

/mdr3.html

Disallow

/licenseOLD.html

Disallow

/license.html

Disallow

/keepalive.html

Disallow

/blank.html

Disallow

/732.HTML

Disallow

/729.html

Disallow

/728.HTML

Disallow

/726.HTML

Disallow

/725.HTML

Disallow

/724.html

Disallow

/723.html

Disallow

/722.html

Back to top

Comments

This robot.txt file is for http://lablearning.com
Each line in the group has the format
<field>:<value>
The purpose it to hopefully "control" the robots/spiders
from indexing directories that have "confidential" data.
Which robot is allowed and where

Back to top

434579.app.netsuite.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

googlebot-image

adsbot-google

*

Comments

434579.app.netsuite.com
robots.txt