irqbalance has been broken for a long time. Its ability to properly detect msi irqs and to correctly identify interrupt types (net vs. storage vs. other, etc), has been based on some tenuous string comparison logic that was easily broken by administrative name changes for interfaces. I've recently submitted this patch: https://lkml.org/lkml/2011/9/19/176 Which lets us use sysfs exclusively for finding device interrupts, which in turns lets us definitavely identify irq types (legacy pci vs. msi), as well as properly classifying them using the pci device class value. Additionally, this patch rips out the code that attemtps to bias interrupt count volumes using network statistics, since theres no sane way to be certain a single network interrupt is responsible for the number of packets received on a given interface. Workload computation is now done on soley on irq count. This may change in the future, adding /proc/stat irq and softirq time to the biasing mechanism. Note that without the above kernel change, this doesn't work right. Irqbalance contains a self check in which it identifies MSI interrupts in /proc/interrupts still. If it sees MSI irqs in /proc/interrupts, but none in sysfs, then it will issue a loud warning about irqs being missclassified until the kernel is updated. Signed-off-by: Neil Horman <nhorman@tuxdriver.com>
57 lines
1.5 KiB
C
57 lines
1.5 KiB
C
/*
|
|
* Copyright (C) 2006, Intel Corporation
|
|
*
|
|
* This file is part of irqbalance
|
|
*
|
|
* This program file is free software; you can redistribute it and/or modify it
|
|
* under the terms of the GNU General Public License as published by the
|
|
* Free Software Foundation; version 2 of the License.
|
|
*
|
|
* This program is distributed in the hope that it will be useful, but WITHOUT
|
|
* ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
|
|
* FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
|
|
* for more details.
|
|
*
|
|
* You should have received a copy of the GNU General Public License
|
|
* along with this program in a file named COPYING; if not, write to the
|
|
* Free Software Foundation, Inc.,
|
|
* 51 Franklin Street, Fifth Floor,
|
|
* Boston, MA 02110-1301 USA
|
|
*/
|
|
|
|
/*
|
|
* This file tries to map numa affinity of pci devices to their interrupts
|
|
* In addition the PCI class information is used to refine the classification
|
|
* of interrupt sources
|
|
*/
|
|
#include "config.h"
|
|
#include <unistd.h>
|
|
#include <stdlib.h>
|
|
#include <stdio.h>
|
|
#include <sys/types.h>
|
|
#include <dirent.h>
|
|
|
|
#include "irqbalance.h"
|
|
|
|
void pci_numa_scan(void)
|
|
{
|
|
int irq = -1;
|
|
cpumask_t mask;
|
|
int node_num;
|
|
do {
|
|
int type;
|
|
irq = get_next_irq(irq);
|
|
if (irq == -1)
|
|
break;
|
|
|
|
mask = find_irq_cpumask_prop(irq, IRQ_LCPU_MASK);
|
|
|
|
node_num = find_irq_integer_prop(irq, IRQ_NUMA);
|
|
|
|
type = find_irq_integer_prop(irq, IRQ_CLASS);
|
|
|
|
add_interrupt_numa(irq, mask, node_num, type);
|
|
|
|
} while (irq != -1);
|
|
}
|