Skip to content

Commit b07c47b

Browse files
author
Kraig Brockschmidt
authored
Merge pull request #702 from MicrosoftDocs/kraigb-rtvs
Adding Data Science workload landing page
2 parents 0025a76 + 891c9e0 commit b07c47b

7 files changed

+91
-0
lines changed

docs/rtvs/TOC.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
# [R Tools for Visual Studio](index.md)
2+
## [Data science and analytical applications workload](data-science-workload.md)
23
## [Installation](installation.md)
34
## [Getting Started with R](getting-started-with-r.md)
45
### [Samples](getting-started-samples.md)

docs/rtvs/data-science-workload.md

Lines changed: 90 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,90 @@
1+
---
2+
title: Data Science and Analytical Applications Workload in Visual Studio | Microsoft Docs
3+
ms.custom: ""
4+
ms.date: 9/5/2017
5+
ms.prod: "visual-studio-dev15"
6+
ms.reviewer: ""
7+
ms.suite: ""
8+
ms.technology:
9+
- "devlang-r"
10+
- "devlang-python"
11+
- "devlang-fsharp"
12+
ms.tgt_pltfrm: ""
13+
ms.topic: "landing-page"
14+
ms.assetid: 018069f3-6d1a-4143-a851-d86d2ff5fbfc
15+
caps.latest.revision: 1
16+
author: "kraigb"
17+
ms.author: "kraigb"
18+
manager: "ghogen"
19+
---
20+
21+
# Data Science and Analytical Applications workload
22+
23+
The Data Science and Analytical Applications workload in Visual Studio brings together three languages and their respective runtime distributions:
24+
25+
- [R and Microsoft R Client](../rtvs/index.md)
26+
- [Python and Anaconda](../python/python-in-visual-studio.md)
27+
- [F# with the .NET framework](https://docs.microsoft.com/dotnet/fsharp/)
28+
29+
![Data Science and Analytics Applications workload in the Visual Studio installer](media/data-science-workload.png)
30+
31+
R and Python are two of the primary scripting languages used for data science. Both languages are easy to learn and are supported by a rich ecosystem of packages. Those packages address a wide range of scenarios such as data acquisition, cleaning, model training, deployment, and plotting. And F# is a powerful functional-first .NET language that’s suited for a wide variety of data processing tasks.
32+
33+
<!--Note link on the image because this one is large -->
34+
[![Screenshots of Visual Studio with R, Python, and F#](media/data-science-workload-screens.png)](media/data-science-workload-screens.png)
35+
36+
## Workload options
37+
38+
By default, the workload installs the following options, which you can modify in the summary section for the workload in the Visual Studio installer:
39+
40+
- F# language support
41+
- Python:
42+
- Python language support
43+
- Python web support
44+
- [Anaconda3 64-bit](https://www.continuum.io) (A Python distro that includes extensive data science libraries and a Python interpreter)
45+
- Cookiecutter template support
46+
- R:
47+
- R language support
48+
- [Microsoft R Client](https://msdn.microsoft.com/microsoft-r/r-client-get-started) (Microsoft’s fully compatible, community-supported R interpreter with ScaleR libraries for faster computation on single nodes or clusters. You can also use any R from [CRAN](https://cran.r-project.org/).)
49+
- Runtime support for R development tools
50+
51+
> [!Note]
52+
> Although F# is included with a number of other workloads and Python has a workload of its own, Data Science and Analytical Applications is the only workload at present that includes R. Independent of the workload, the three R components are also selectable on the **Individual Components** tab in the installer. Select the options **Development activities > R language support**, **Development activities > Microsoft R Client**, and **Compilers, build tools, and runtimes > Runtime support for R development tools**.
53+
54+
## SQL Server integration
55+
56+
SQL Server supports using both R and Python to do advanced analytics directly inside SQL Server. R support is included with SQL Server 2016 and later; Python support is available in SQL Server 2017 CTP 2.0 and later.
57+
58+
By running your code where your data already lives, you enjoy a number of advantages:
59+
60+
- **Elimination of data movement**: Instead of moving data from the database to your application or model, you can build R and Python applications in the database. This capability eliminates barriers of security, compliance, governance, integrity, and a host of similar issues related to moving vast amounts of data around. It also enables you to consume datasets that couldn't fit into the memory of a client machine.
61+
62+
- **Easy deployment**: Once you have the R or Python model ready, deploying it to production means simply embedding it in a T-SQL script. Any SQL client application written in any language can then take advantage of the models and intelligence through a stored procedure call. No specific R or Python integrations are necessary.
63+
64+
- **Enterprise-grade performance and scale**: You can use SQL Server’s advanced capabilities like in-memory table and column store indexes with the high-performance scalable APIs in the RevoScaleR and RevoScalePy packages. The elimination of data movement also means that you avoid client memory constraints as your data grows or you wish to increase the performance of the application.
65+
66+
- **Rich extensibility**: You can install and run any of the latest open source R or Python packages in SQL Server to build deep learning and AI applications on huge amounts of data in SQL Server. Installing a package in SQL Server is as simple as installing a package on your local machine.
67+
68+
- **Wide availability at no additional cost**: R and Python integrations are available in all editions of SQL Server 2017 and later, including the Express edition. (R support is available in SQL Server 2016 and later.)
69+
70+
To take full advantage of SQL Server integration, you should also install the **Data storage and processing** workload with the **SQL Server Data Tools** option. This option enables SQL IntelliSense, syntax highlighting, and deployment.
71+
72+
![Data storage and processing workload](media/data-storage-workload.png) &nbsp;&nbsp; &nbsp;&nbsp; ![Data storage and processing workload options](media/data-storage-workload-options.png)
73+
74+
75+
For more information:
76+
77+
- [Working with SQL Server and R](../rtvs/sql-server.md)
78+
- [In-database Advanced Analytics with R in SQL Server 2016](https://blogs.technet.microsoft.com/dataplatforminsider/2016/03/29/in-database-advanced-analytics-with-r-in-sql-server-2016/)
79+
- [Python in SQL Server 2017: enhanced in-database machine learning](https://blogs.technet.microsoft.com/dataplatforminsider/2017/04/19/python-in-sql-server-2017-enhanced-in-database-machine-learning/)
80+
81+
## Additional services and SDKs
82+
83+
In addition to what's in the Data Science and Analytics Applications workload directly, the Azure Notebooks service and the Azure SDK for Python are also helpful for data science.
84+
85+
The Azure SDK for Python makes it easy to consume and manage Microsoft Azure services from applications running on Windows, Mac OSX, and Linux. For more information, see [Azure SDK for Python](../python/azure-sdk-for-python.md)
86+
87+
Azure Notebooks (currently in preview) provides free online access to Jupyter notebooks running in the cloud on Microsoft Azure. The service includes sample notebooks in Python, R, and F# to get you started. Visit[notebooks.azure.com](https://notebooks.azure.com/).
88+
89+
<!--Note link on the image because this one is large -->
90+
[![Screenshots of Azure Notebooks with the Introduction to R sample](media/data-science-workload-notebooks.png)](media/data-science-workload-notebooks.png)
Loading
Loading
4.33 KB
Loading
Loading
4.44 KB
Loading

0 commit comments

Comments
 (0)