1 min readFeb 20, 2018
Thanks for your response,
- Yes
- p(y|X,w) is likelihood distribution and defined as Bernoulli for logistic regression, so it is nothing but the pmf of Bernoulli
- Each instance has its own probability of success and it is defined by sigmoid of linear mapping which differs from instance to instance. All instances share the same weight w and it is model parameter that we want to learn but x(t) and y(t) are depend on t and t indicates which instance we are using. So when x(t) change, probability of success also changes.