Skip to main content

Table 7 Optimal combination of treatments for a variety of states at Stage 2

From: Comparative effectiveness research on patients with acute ischemic stroke using Markov decision processes

Cases

States at t2

Actions at t2

Rewards at t3*

 

i 1

i 2

i 3

i 4

i 5

i 6

a 1

a 2

a 3

a 4

a 5

 

127

3

1

0

1

2

1

0

1

0

1

1

1.000

119

3

1

0

1

2

2

0

0

0

0

1

4.000

60

3

1

0

1

2

3

1

0

0

0

1

4.667

53

2

1

0

1

2

1

1

0

0

0

1

1.000

51

3

1

0

1

1

1

1

0

0

1

1

1.000

42

3

0

0

1

2

1

1

1

0

0

1

0.167

41

3

1

0

1

1

2

1

0

1

1

0

5.000

39

3

1

0

1

1

3

0

1

0

0

1

6.000

38

3

0

0

1

2

3

1

0

0

1

1

9.000

35

2

1

0

1

2

2

1

0

0

0

1

4.000

31

2

1

0

1

1

2

0

0

1

1

1

2.000

30

3

0

0

1

2

2

1

1

0

1

1

1.333

29

2

1

0

1

1

1

1

0

1

0

1

0.500

26

2

1

0

1

2

3

1

1

0

0

1

2.571

23

2

1

0

1

1

3

1

0

0

0

1

7.000

22

2

0

0

1

2

1

1

1

0

0

1

0.667

19

2

0

0

1

1

1

0

0

1

1

1

1.000

19

2

0

0

1

2

2

1

1

0

1

1

2.000

19

3

0

0

1

1

1

0

0

1

0

1

3.000

19

3

0

0

1

1

2

1

1

1

0

1

2.500

18

2

0

0

1

2

3

1

1

0

0

1

1.000

18

3

1

0

1

3

1

1

0

0

0

1

2.636

17

2

0

0

1

1

2

1

0

0

0

1

5.000

12

3

1

1

1

2

3

0

1

0

0

1

3.000

11

3

1

0

1

3

2

1

1

0

0

1

2.000

11

3

1

0

1

4

1

1

0

1

1

1

0.500

9

2

1

0

1

3

1

1

0

0

1

1

2.000

9

3

0

0

1

1

3

1

0

1

1

1

2.000

7

2

1

0

1

4

2

1

0

1

1

1

0.333

6

3

1

1

1

2

2

1

0

1

0

1

0.000

  1. *At timepoint 2 (t2) Actionsare given to the States, and get Rewardsat t3. (The following the same)