# Theory of Scheduling

**10/6/11**

In the work we do we create and read schedules that are used for our many projects.

In our daily work we tend to look at scheduling as using Microsoft Project. In our Project Management classes, we focused on the fundamental ideas of the critical path method (CPM) and the Program Review and Evaluation Technique (PERT), two network methods that were developed independently in the 1950's. In this blog I'll talk a little more about the mathematical theory of PERT.

By a network method, it is meant that the logical relationships between project activities are displayed in a graph. PERT recognizes that activity durations are random variables. Let

p_{j} = duration of activity j

m_{j} = mean of p_{j}, (assumed to be known)

s_{j}^{2} = variance of p_{j} (assumed to be known)

The PERT model assumes that the activities in the network are statistically independent and that the critical path (CP) in the network contains a "large" number of activities. The latter assumption is so we can apply the central limit theorem (CLT).

Let D_{k} denote the sum of the durations of activities along path k in the network. If there are many activities on the path, then D_{k}
follows a normal random variable with mean m_{k} = sum(m_{j}), for all j in k and variance s_{k}^{2} = sum(s_{j}^{2}), for all j in k.

By the CLT, the probability that the project will be completed by due date t is therefore P(D_{CP} <= t) = f[ (t-m_{CP})/s_{CP} ], where f is the
cumulative distribution function for a standard normal random variable.

How do we know m_{j} and s_{j}^{2} ? Often there are no similar activities in similar projects to estimate these from. A rule of
thumb that has shown success in practice is

O = an optimistic duration for an activity

P = a pessimistic duration for an activity

M = the most likely duration for an activity

These three parameters are incorporated in a beta distribution as a probabilistic model for the duration of the activity. The parameters O and P are the minimum and maximum of the activity duration distribution and M is its mode. The original PERT team recommended

m_{j} = (O+4M+P)/6 (note that this is a weighted average, weighting the most likely duration more heavily)

s_{j} = (P-O)/6

There have been many refinements to PERT-type methods over the years that address its simplistic (although very useful and practical) assumptions. The take-away message is that many events are unpredictable and sometimes questions about a project can only be answered in probabilistic terms. For complex projects, with several thousand line items in the schedule, there is no other way.

If you enjoyed *any* of my content, please consider supporting it in a variety of ways:

- Email news@statisticool.com to sign up to receive news and updates
- Donate any amount via PayPal
- Take my Five Poem Challenge
- Buy ad space on Statisticool.com
- Buy what you need on Amazon using my affiliate link
- Follow me on Twitter here