Bayesian workflow with PyMC3 and ArviZ

1.
Bayesian Workflow with PyMCand ArviZ Corrie Bartelheimer Data Scientist at Europace AG @corrieaar

2.
First, the problem

3.
First, the problem

4.
First, the problem

5.
First, the problem

6.
First, the problem

7.
First, the problem

8.
First, the problem

9.
First, the problem

10.
Solution: Hierarchical Bayesian Model

11.
Solution: Hierarchical BayesianModel

12.

13.

14.

15.

16.
Getting this intoPython

17.
import pymc3 aspm with pm.Model() as lin_model: α = pm.Normal("α", 0, 100) β = pm.Normal("β", 0, 100) σ = pm.Exponential("σ", 1/100) μ = α + β*d["area"] y = pm.Normal("y", μ, σ, observed=d["price"]) Getting this into Python: PyMC3

18.

19.

20.
Getting this intoPython: PyMC3 import pymc3 as pm with pm.Model() as lin_model: α = pm.Normal("α", 0, 100) β = pm.Normal("β", 0, 100) σ = pm.Exponential("σ", 1/100) μ = α + β*d["area"] y = pm.Normal("y", μ, σ, observed=d["price"])

21.
with pm.Model() ashier_model: μ_α = pm.Normal("μ_α", 0, 100) μ_β = ... σ = pm.Exponential("σ", 1/100) σ_α = σ_β = ... α = pm.Normal("α", μ_α, σ_α, shape=num_zip) β = pm.Normal("β", μ_β, σ_β, shape=num_zip) μ = α[d["zip"]] + β[d["zip"]]*d["area"] y = pm.Normal("y", μ, σ, observed=d["price"]) Getting this into Python: PyMC3

22.

23.

24.

25.
What about thepriors?

26.

27.
What about thepriors? with model: prior = pm.sample_prior_predictive()

28.
with model: prior =pm.sample_prior_predictive() What about the priors?

29.

30.

31.

32.

33.

34.

35.

36.
with pm.Model() ashier_model: μ_α = pm.Normal("μ_α", 0, 20) μ_β = pm.Normal("μ_β", 0, 5) σ = pm.Exponential("σ", 1/5) σ_α = σ_β = ... α = pm.Normal("α", μ_α, σ_α, shape=num_zip) β = pm.Normal("β", μ_β, σ_β, shape=num_zip) μ = α[d["zip"]] + β[d["zip"]]*d["area"] y = pm.Normal("y", μ, σ, observed=d["price"]) trace = pm.sample() What about the priors?

37.
Did it converge?

38.
Did it converge? importarviz as az az.plot_trace(trace)

39.
Did it converge?

40.
Did it converge? SomeBad Examples

41.

42.

43.

44.
Did it converge? az.summary(trace)

45.

46.

47.

48.
Did it converge? Rhatstatistic smaller 1.05? Effective sample size / iterations greater 10%? Monte Carlo se / posterior sd smaller 10%?

49.
How good doesmy model fit the data?

50.
How good doesmy model fit the data? with hier_model: posterior_predictive = pm.sample_posterior_predictive(trace)

51.

52.

53.

54.
Results, please!

55.
Results, please!

56.
Results, please!

57.
Results, please!

58.
Results, please!

59.
What’s next?

60.
What’s next? ● Iterate! ●More predictors! ○ Year of construction ○ House type ○ ... ● More hierarchies! ● Add group predictors! ○ Percentage of green areas ○ Economical indices ● Try different likelihoods ● Probably save more money...

61.
Further resources RichardMcElreath: Statistical Rethinking - Port to PyMC3 Prior Recommendation by Stan Team Michael Betancourts Case Studies BerlinBayesians Icons by icons8

62.
Thanks! @corrieaar corriebar Code and Notebooks www.samples-of-thoughts.com Iconsby icons8

Bayesian workflow with PyMC3 and ArviZ

In this document