New customer prediction data

The methods documented here are to be used together with predict (transactions) to obtain the expected number of transactions of an average newly alive customer and with predict (spending) to obtain the expected spending of an average newly alive customer. This prediction is only sensible for (fictional) customers without order history: Customers which just came alive and have not had the chance to reveal any more of their behavior.

The methods described here produce the data required as input to predict(newdata=) to make this new customer prediction. This is mostly covariate data for static and dynamic covariate models. See details for the required format.

newcustomer(), newcustomer.static(), newcustomer.dynamic(): To predict the number of transactions a single, fictional, average new customer is expected to make in the num.periods periods since making the first transaction ("coming alive").

newcustomer.spending(): To estimate how much a single, fictional, average new customer is expected to spend on average per transaction.

newcustomer(num.periods)

newcustomer.static(num.periods, data.cov.life, data.cov.trans)

newcustomer.dynamic(
  num.periods,
  data.cov.life,
  data.cov.trans,
  first.transaction
)

newcustomer.spending()

Arguments

num.periods: A positive, numeric scalar indicating the number of periods to predict.
data.cov.life: Numeric-only covariate data for the lifetime process for a single customer, data.table or data.frame. See details.
data.cov.trans: Numeric-only covariate data for the transaction process for a single customer, data.table or data.frame. See details.
first.transaction: For dynamic covariate models only: The time point of the first transaction of the customer ("coming alive") for which a prediction is made. Has to be within the time range of the covariate data.

Value

newcustomer(): An object of class clv.newcustomer.no.cov
newcustomer.static(): An object of class clv.newcustomer.static.cov
newcustomer.dynamic(): An object of class clv.newcustomer.dynamic.cov
newcustomer.spending(): An object of class clv.newcustomer.spending

Details

The covariate data has to contain one column for every covariate parameter in the fitted model. Only numeric values are allowed, no factors or characters. No customer Id is required because the data on which the model was fit is not used for this prediction.

For newcustomer.static(): One column for every covariate parameter in the estimated model. No column Id. Exactly 1 row of numeric covariate data.
For example: data.frame(Gender=1, Age=30, Channel=0).

For newcustomer.dynamic(): One column for every covariate parameter in the estimated model. No column Id. A column Cov.Date with time points that mark the start of the period defined by time.unit. For every Cov.Date, exactly 1 row of numeric covariate data.
For example for weekly covariates: data.frame(Cov.Date=c("2000-01-03", "2000-01-10"), Gender=c(1,1), Channel=c(1, 1), High.Season=c(0,1,0))
If Cov.Date is of type character, the date.format given when creating the the clv.data object is used to parse it. The data has to cover the time from the customer's first transaction first.transaction to the end of the prediction period given by t. It does not have to cover the same time range as when fitting the model. See examples.

For models with dynamic covariates, the time point of the first purchase (first.transaction) is additionally required because the exact covariates that are active during the prediction period have to be known.

Examples

# \donttest{
data("apparelTrans")
data("apparelStaticCov")
data("apparelDynCov")

clv.data.apparel <- clvdata(apparelTrans, date.format = "ymd",
                            time.unit = "w", estimation.split = 52)
clv.data.static.cov <-
 SetStaticCovariates(clv.data.apparel,
                     data.cov.life = apparelStaticCov,
                     names.cov.life = "Gender",
                     data.cov.trans = apparelStaticCov,
                     names.cov.trans = c("Gender", "Channel"))
clv.data.dyn.cov <-
  SetDynamicCovariates(clv.data = clv.data.apparel,
                       data.cov.life = apparelDynCov,
                       data.cov.trans = apparelDynCov,
                       names.cov.life = c("High.Season", "Gender"),
                       names.cov.trans = c("High.Season", "Gender"),
                       name.date = "Cov.Date")



# No covariate model
p.apparel <- pnbd(clv.data.apparel)
#> Starting estimation...
#> Estimation finished!

# Predict the number of transactions an average new
# customer is expected to make in the first 3.68 weeks
predict(
  p.apparel,
  newdata=newcustomer(num.periods=3.68)
)
#> [1] 0.1026112


# Spending model
gg.apparel <- gg(clv.data.apparel)
#> Starting estimation...
#> Estimation finished!
predict(gg.apparel, newdata = newcustomer.spending())
#> [1] 37.12434



# Static covariate model
p.apparel.static <- pnbd(clv.data.static.cov)
#> Starting estimation...
#> Estimation finished!

# Predict the number of transactions an average new
# customer who is female (Gender=1) and who was acquired
# online (Channel=1) is expected to make in the first 3.68 weeks
predict(
  p.apparel.static,
  newdata=newcustomer.static(
    num.periods=3.68,
    # For the lifetime process, only Gender was used when fitting
    data.cov.life=data.frame(Gender=1),
    data.cov.trans=data.frame(Gender=1, Channel=0)
  )
)
#> [1] 0.1028615


if (FALSE) { # \dontrun{
# Dynamic covariate model

p.apparel.dyn <- pnbd(clv.data.dyn.cov)

# Predict the number of transactions an average new
# customer who is male (Gender=0), who did not purchase during
# high.season, and who was
# acquired on "2005-02-16" (first.transaction) is expected
# to make in the first 2.12 weeks.
# Note that the time range is very different from the one used
# when fitting the model. Cov.Date still has to match the
# beginning of the week.
predict(
  p.apparel.dyn,
  newdata=newcustomer.dynamic(
    num.periods=2.12,
    data.cov.life=data.frame(
      Cov.Date=c("2051-02-12", "2051-02-19", "2051-02-26"),
      Gender=c(0, 0, 0),
      High.Season=c(4, 0, 7)),
    data.cov.trans=data.frame(
      Cov.Date=c("2051-02-12", "2051-02-19", "2051-02-26"),
      Gender=c(0, 0, 0),
      High.Season=c(4, 0, 7)),
    first.transaction = "2051-02-16"
  )
)
} # }
# }

Arguments

Value

Details

See also

Examples