Previuosly we treated the all load as originating from a single cpu. Thats not
true in the event an interrupt is assigned to a higher level object, like a
cache domain. This change spreads the load out, attempting to use a hureistic
whereby we assume that a fair share of interrupts from parent objects is handled
by this cpu, so we only attribute a fraction of the total shared load to each
single cpu. This allows us to come up with a reasonable load value for
interrupts assigned to cache domains, packages, nodes, etc
Signed-off-by: Neil Horman <nhorman@tuxdriver.com>