I am new here, hope to help and be helped.
However, I am working on the new Microsoft Sql Server Management Studio (2016), using its new features that imply the integration with R. First of all, my goal is to create a stored procedure that perform a K-Means clustering with x and y column.
The problem is that I am stuck in the middle, because I am not able to decline the online documentation to my case.
Here the script
CREATE TABLE [dbo].[ModelTable]
(
column_name1 varchar(8000)
)
;
CREATE TABLE [dbo].[ResultTable]
(
column_name1 varchar(8000),
column_name2 varchar(8000),
column_name3 varchar(8000),
column_name4 varchar(8000)
)
;
CREATE PROCEDURE [dbo].[kmean]
AS
BEGIN
DECLARE @inquery nvarchar(max) = N'
select name,x,y FROM [dbtable]
'
-- then I decide to insert the model in a table: this is similar to the documentation, but I am not sure it fits well.
INSERT INTO [dbo].[ModelTable]
EXEC sp_execute_external_script @language = N'R',
@script = N'
## Here I create model: this is one of the biggest problem, because I tried to create a data frame with the data, but I do not know if here,
## in the R code, the data are read in this way. Generally in "pure" R, I write data.frame(sourcedata$x,sourcedata$y), but here, where is source of data?
## In the documentation it is used ImputDataSet, so maybe I could do:
trained_model <- kmeans(data.frame(ImputDataSet$x,ImputDataSet$y),8)
-- If everything is ok (doubtfully) I should have the model. And here, the part that I really cannot handle.
-- I'd like to have a table [ResultTable] with name, variable x, variable y, and trainedmodel$cluster.
',
@input_data_1 = @inquery,
@output_data_1_name = N'trained_model'
;
END
GO
EXEC kmean
Well there many problems and so on and, due the fact that this is a quite brand new feature in MSSMS, there are not a great quantity of help etc. in the Internet. Thanks in advance