| glm {SparkR} | R Documentation | 
Fits a generalized linear model, similarly to R's glm().
glm(formula, family = gaussian, data, weights, subset, na.action, start = NULL, etastart, mustart, offset, control = list(...), model = TRUE, method = "glm.fit", x = FALSE, y = TRUE, contrasts = NULL, ...) ## S4 method for signature 'formula,ANY,SparkDataFrame' glm(formula, family = gaussian, data, epsilon = 1e-06, maxit = 25)
formula | 
 a symbolic description of the model to be fitted. Currently only a few formula operators are supported, including '~', '.', ':', '+', and '-'.  | 
family | 
 a description of the error distribution and link function to be used in the model. This can be a character string naming a family function, a family function or the result of a call to a family function. Refer R family at https://stat.ethz.ch/R-manual/R-devel/library/stats/html/family.html.  | 
data | 
 a SparkDataFrame or R's glm data for training.  | 
weights | 
 an optional vector of ‘prior weights’ to be used
in the fitting process.  Should be   | 
subset | 
 an optional vector specifying a subset of observations to be used in the fitting process.  | 
na.action | 
 a function which indicates what should happen
when the data contain   | 
start | 
 starting values for the parameters in the linear predictor.  | 
etastart | 
 starting values for the linear predictor.  | 
mustart | 
 starting values for the vector of means.  | 
offset | 
 this can be used to specify an a priori known
component to be included in the linear predictor during fitting.
This should be   | 
control | 
 a list of parameters for controlling the fitting
process.  For   | 
model | 
 a logical value indicating whether model frame should be included as a component of the returned value.  | 
method | 
 the method to be used in fitting the model.  The default
method  User-supplied fitting functions can be supplied either as a function
or a character string naming a function, with a function which takes
the same arguments as   | 
x,y | 
 For   | 
contrasts | 
 an optional list. See the   | 
... | 
 For  For   | 
epsilon | 
 positive convergence tolerance of iterations.  | 
maxit | 
 integer giving the maximal number of IRLS iterations.  | 
glm returns a fitted generalized linear model.
glm since 1.5.0
## Not run: 
##D sparkR.session()
##D data(iris)
##D df <- createDataFrame(iris)
##D model <- glm(Sepal_Length ~ Sepal_Width, df, family = "gaussian")
##D summary(model)
## End(Not run)