Centos network dropped ...

voytek

Verified User
Joined
Apr 26, 2010
Messages
22
I have a problem with network on centos, after i restart the server it works for few hours, and the network is dropped. - no errors in the log, only that "Network is unreachable"
I do 'service network restart' comes back on- directadmin, dns, awbs- everything works good, but only for few hours(never longer then 1 day), and then it is dropped again.
I searched centos forums, and on one them someone wrote that it is because that static IP was used on the same network by other machine. I had a different server running with that ip on my network before, but it was few weeks ago, and there is no other server connected to the network right now (I have one desktop connected with dynamic ip, and it has no problem), and i still have the same problem. They suggested that there is something wrong with directadmin, or its configuration.
Somebody said that my old server ( the one that had same ip) is still somewhere in the DA cache - Is that right??? if yes- how to clear it?
I was suggested to set a cron jobs to restart network every few hours- i thing that is not a solution.
Does anyone have any idea what could be a problem? Anyone had similar problems?
Could there be any other reasons for the network to be dropped after few hours?
Thank You
 
I had a similar problem at work, nothing to do with DirectAdmin.

I was seeing the network disappear. I originally had a 'forcedeth' based card and it would just stop working until I restarted the network. I read a few reports of others having the same problem, so I threw in a 'tulip' based card since that driver was much older and better supported.

It still happened a few more times, although less often. One of the things I noticed, was that we had this box connected to a really old ethernet hub/switch and the best connection I could get with this card was 10/half-duplex. I upgraded the switch to a 100mbs switch and so far we haven't lost the network since (been over two weeks now).


What card do you have and what is the connection speed?
 
'Broadcom NetXtreme Gigabit' - the one integrated with asus board
(it is a new board) - and i updating drivers.
- one more thing before loading centos on that system - i had windows server 2008 r2 on it for about 2 weeks - and i had no problem.

Thanks
 
I really think it is a Linux thing, and not really something to do with DirectAdmin or even the network card itself. I was poking around some of the code (I don't remember if it was the driver code or some code higher in the stack). I saw some code that would disable the network card when an error occurred, in my case, it looked like it was the I/O buffer being overrun. I didn't see anything in a log file, but a dmesg did show a problem with the network card, and that is where I found that source code.
 
I still have no luck getting this server to run for more than few couple/days.
Here is latest 'dmesg' output after losing network:


[root@server ~]# dmesg
Linux version 2.6.18-194.8.1.el5 ([email protected]) (gcc version 4.1.2 20080704 (Red Hat 4.1.2-48)) #1 SMP Thu Jul 1 19:04:48 EDT 2010
Command line: ro root=LABEL=/ rhgb quiet
BIOS-provided physical RAM map:
BIOS-e820: 0000000000010000 - 000000000009fc00 (usable)
BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000007fff0000 (usable)
BIOS-e820: 000000007fff0000 - 000000007ffff000 (ACPI data)
BIOS-e820: 000000007ffff000 - 0000000080000000 (ACPI NVS)
BIOS-e820: 00000000fffc0000 - 0000000100000000 (reserved)
DMI 2.3 present.
ACPI: RSDP (v000 ACPIAM ) @ 0x00000000000f5210
ACPI: RSDT (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0000
ACPI: FADT (v002 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0200
ACPI: WAET (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0b00
ACPI: SLIC (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0b40
ACPI: OEM0 (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0d40
ACPI: SRAT (v002 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0600
ACPI: MADT (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007fff0300
ACPI: OEMB (v001 VRTUAL MICROSFT 0x03000919 MSFT 0x00000097) @ 0x000000007ffff240
ACPI: DSDT (v001 MSFTVM MSFTVM02 0x00000002 INTL 0x02002026) @ 0x0000000000000000
SRAT: PXM 1 -> APIC 0 -> Node 0
SRAT: PXM 1 -> APIC 1 -> Node 0
SRAT: Node 0 PXM 1 0-80000000
SRAT: Node 0 PXM 1 0-f8000000
SRAT: hot plug zone found 80000000 - f8000000
SRAT: Hotplug region ignored
SRAT: Node 0 PXM 1 0-300000000
SRAT: Hotplug zone not continuous. Partly ignored
SRAT: Hotplug region ignored
NUMA: Using 63 for the hash shift.
Bootmem setup node 0 0000000000000000-000000007fff0000
Memory for crash kernel (0x0 to 0x0) notwithin permissible range
disabling kdump
On node 0 totalpages: 515694
DMA zone: 2629 pages, LIFO batch:0
DMA32 zone: 513065 pages, LIFO batch:31
ACPI: PM-Timer IO Port: 0x408
ACPI: Local APIC address 0xfee00000
ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
Processor #0 7:7 APIC version 20
ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
Processor #1 7:7 APIC version 20
ACPI: LAPIC (acpi_id[0x03] lapic_id[0x02] disabled)
ACPI: LAPIC (acpi_id[0x04] lapic_id[0x03] disabled)
ACPI: LAPIC (acpi_id[0x05] lapic_id[0x04] disabled)
ACPI: LAPIC (acpi_id[0x06] lapic_id[0x05] disabled)
ACPI: LAPIC (acpi_id[0x07] lapic_id[0x06] disabled)
ACPI: LAPIC (acpi_id[0x08] lapic_id[0x07] disabled)
ACPI: LAPIC (acpi_id[0x09] lapic_id[0x08] disabled)
ACPI: LAPIC (acpi_id[0x0a] lapic_id[0x09] disabled)
ACPI: LAPIC (acpi_id[0x0b] lapic_id[0x0a] disabled)
ACPI: LAPIC (acpi_id[0x0c] lapic_id[0x0b] disabled)
ACPI: LAPIC (acpi_id[0x0d] lapic_id[0x0c] disabled)
ACPI: LAPIC (acpi_id[0x0e] lapic_id[0x0d] disabled)
ACPI: LAPIC (acpi_id[0x0f] lapic_id[0x0e] disabled)
ACPI: LAPIC (acpi_id[0x10] lapic_id[0x0f] disabled)
ACPI: LAPIC (acpi_id[0x11] lapic_id[0x10] disabled)
ACPI: LAPIC (acpi_id[0x12] lapic_id[0x11] disabled)
ACPI: LAPIC (acpi_id[0x13] lapic_id[0x12] disabled)
ACPI: LAPIC (acpi_id[0x14] lapic_id[0x13] disabled)
ACPI: LAPIC (acpi_id[0x15] lapic_id[0x14] disabled)
ACPI: LAPIC (acpi_id[0x16] lapic_id[0x15] disabled)
ACPI: LAPIC (acpi_id[0x17] lapic_id[0x16] disabled)
ACPI: LAPIC (acpi_id[0x18] lapic_id[0x17] disabled)
ACPI: LAPIC (acpi_id[0x19] lapic_id[0x18] disabled)
ACPI: LAPIC (acpi_id[0x1a] lapic_id[0x19] disabled)
ACPI: LAPIC (acpi_id[0x1b] lapic_id[0x1a] disabled)
ACPI: LAPIC (acpi_id[0x1c] lapic_id[0x1b] disabled)
ACPI: LAPIC (acpi_id[0x1d] lapic_id[0x1c] disabled)
ACPI: LAPIC (acpi_id[0x1e] lapic_id[0x1d] disabled)
ACPI: LAPIC (acpi_id[0x1f] lapic_id[0x1e] disabled)
ACPI: LAPIC (acpi_id[0x20] lapic_id[0x1f] disabled)
ACPI: LAPIC (acpi_id[0x21] lapic_id[0x20] disabled)
ACPI: LAPIC (acpi_id[0x22] lapic_id[0x21] disabled)
ACPI: LAPIC (acpi_id[0x23] lapic_id[0x22] disabled)
ACPI: LAPIC (acpi_id[0x24] lapic_id[0x23] disabled)
ACPI: LAPIC (acpi_id[0x25] lapic_id[0x24] disabled)
ACPI: LAPIC (acpi_id[0x26] lapic_id[0x25] disabled)
ACPI: LAPIC (acpi_id[0x27] lapic_id[0x26] disabled)
ACPI: LAPIC (acpi_id[0x28] lapic_id[0x27] disabled)
ACPI: LAPIC (acpi_id[0x29] lapic_id[0x28] disabled)
ACPI: LAPIC (acpi_id[0x2a] lapic_id[0x29] disabled)
ACPI: LAPIC (acpi_id[0x2b] lapic_id[0x2a] disabled)
ACPI: LAPIC (acpi_id[0x2c] lapic_id[0x2b] disabled)
ACPI: LAPIC (acpi_id[0x2d] lapic_id[0x2c] disabled)
ACPI: LAPIC (acpi_id[0x2e] lapic_id[0x2d] disabled)
ACPI: LAPIC (acpi_id[0x2f] lapic_id[0x2e] disabled)
ACPI: LAPIC (acpi_id[0x30] lapic_id[0x2f] disabled)
ACPI: LAPIC (acpi_id[0x31] lapic_id[0x30] disabled)
ACPI: LAPIC (acpi_id[0x32] lapic_id[0x31] disabled)
ACPI: LAPIC (acpi_id[0x33] lapic_id[0x32] disabled)
ACPI: LAPIC (acpi_id[0x34] lapic_id[0x33] disabled)
ACPI: LAPIC (acpi_id[0x35] lapic_id[0x34] disabled)
ACPI: LAPIC (acpi_id[0x36] lapic_id[0x35] disabled)
ACPI: LAPIC (acpi_id[0x37] lapic_id[0x36] disabled)
ACPI: LAPIC (acpi_id[0x38] lapic_id[0x37] disabled)
ACPI: LAPIC (acpi_id[0x39] lapic_id[0x38] disabled)
ACPI: LAPIC (acpi_id[0x3a] lapic_id[0x39] disabled)
ACPI: LAPIC (acpi_id[0x3b] lapic_id[0x3a] disabled)
ACPI: LAPIC (acpi_id[0x3c] lapic_id[0x3b] disabled)
ACPI: LAPIC (acpi_id[0x3d] lapic_id[0x3c] disabled)
ACPI: LAPIC (acpi_id[0x3e] lapic_id[0x3d] disabled)
ACPI: LAPIC (acpi_id[0x3f] lapic_id[0x3e] disabled)
ACPI: IOAPIC (id[0x00] address[0xfec00000] gsi_base[0])
IOAPIC[0]: apic_id 0, version 17, address 0xfec00000, GSI 0-23
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
ACPI: IRQ0 used by override.
ACPI: IRQ2 used by override.
ACPI: IRQ9 used by override.
Setting APIC routing to physical flat
Using ACPI (MADT) for SMP configuration information
Nosave address range: 000000000009f000 - 00000000000a0000
Nosave address range: 00000000000a0000 - 00000000000e0000
Nosave address range: 00000000000e0000 - 0000000000100000
Allocating PCI resources starting at 88000000 (gap: 80000000:7ffc0000)
SMP: Allowing 63 CPUs, 61 hotplug CPUs
Built 1 zonelists. Total pages: 515694
Kernel command line: ro root=LABEL=/ rhgb quiet
Initializing CPU#0
PID hash table entries: 4096 (order: 12, 32768 bytes)
Console: colour VGA+ 80x25
Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes)
Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes)
Checking aperture...
ACPI: DMAR not present
Memory: 2054660k/2097088k available (2575k kernel code, 41976k reserved, 1303k data, 212k init)
Calibrating delay loop (skipped), value calculated using timer frequency.. 4543.21 BogoMIPS (lpj=2271609)
Security Framework v1.0.0 initialized
SELinux: Initializing.
SELinux: Starting in permissive mode
selinux_register_security: Registering secondary module capability
Capability LSM initialized as secondary
Mount-cache hash table entries: 256
CPU: L1 I cache: 32K, L1 D cache: 32K
CPU: L2 cache: 6144K
CPU 0/0 -> Node 0
CPU: Physical Processor ID: 0
CPU: Processor Core ID: 0
SMP alternatives: switching to UP code
ACPI: Core revision 20060707
activating NMI Watchdog ... done.
Using local APIC timer interrupts.
WARNING calibrate_APIC_clock: the APIC timer calibration may be wrong.
Detected 11.986 MHz APIC timer.
SMP alternatives: switching to SMP code
Booting processor 1/2 APIC 0x1
Initializing CPU#1
Calibrating delay using timer specific routine.. 4506.46 BogoMIPS (lpj=2253234)
CPU: L1 I cache: 32K, L1 D cache: 32K
CPU: L2 cache: 6144K
CPU 1/1 -> Node 0
CPU: Physical Processor ID: 0
CPU: Processor Core ID: 1
Intel(R) Xeon(R) CPU E5410 @ 2.33GHz stepping 0a
Brought up 2 CPUs
testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (0->0)!
time.c: Using 3.579545 MHz WALL PM GTOD PIT/TSC timer.
time.c: Detected 2271.609 MHz processor.
sizeof(vma)=176 bytes
sizeof(page)=56 bytes
sizeof(inode)=560 bytes
sizeof(dentry)=216 bytes
sizeof(ext3inode)=760 bytes
sizeof(buffer_head)=96 bytes
sizeof(skbuff)=248 bytes
migration_cost=6
checking if image is initramfs... it is
Freeing initrd memory: 2595k freed
NET: Registered protocol family 16
ACPI: bus type pci registered
PCI: Using configuration type 1
ACPI: Interpreter enabled
ACPI: Using IOAPIC for interrupt routing
ACPI: No dock devices found.
ACPI: PCI Root Bridge [PCI0] (0000:00)
PCI quirk: region 0400-043f claimed by PIIX4 ACPI
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 7 9 10 *11 12 14 15)
ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled.
Linux Plug and Play Support v0.97 (c) Adam Belay
pnp: PnP ACPI init
pnp: PnP ACPI: found 13 devices
usbcore: registered new driver usbfs
usbcore: registered new driver hub
PCI: Using ACPI for IRQ routing
PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report
NetLabel: Initializing
NetLabel: domain hash size = 128
NetLabel: protocols = UNLABELED CIPSOv4
NetLabel: unlabeled traffic allowed by default
ACPI: DMAR not present
PCI-GART: No AMD northbridge found.
pnp: 00:0b: ioport range 0x400-0x43f could not be reserved
pnp: 00:0b: ioport range 0x370-0x371 has been reserved
pnp: 00:0b: ioport range 0x440-0x44f has been reserved
NET: Registered protocol family 2
IP route cache hash table entries: 65536 (order: 7, 524288 bytes)
TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
TCP: Hash tables configured (established 262144 bind 65536)
TCP reno registered
audit: initializing netlink socket (disabled)
type=2000 audit(1280177812.599:1): initialized
Total HugeTLB memory allocated, 0
VFS: Disk quotas dquot_6.5.1
Dquot-cache hash table entries: 512 (order 0, 4096 bytes)
SELinux: Registering netfilter hooks
Initializing Cryptographic API
alg: No test for crc32c (crc32c-generic)
ksign: Installing public key data
Loading keyring
- Added public key 71959A475B93578
- User ID: CentOS (Kernel Module GPG key)
io scheduler noop registered
io scheduler anticipatory registered
io scheduler deadline registered
io scheduler cfq registered (default)
Limiting direct PCI/PCI transfers.
Boot video device is 0000:00:08.0
pci_hotplug: PCI Hot Plug PCI Core version: 0.5
Real Time Clock Driver v1.12ac
Non-volatile memory driver v1.2
Linux agpgart interface v0.101 (c) Dave Jones
Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
00:07: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
00:08: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
brd: module loaded
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller at PCI slot 0000:00:07.1
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xffa8-0xffaf, BIOS settings: hdc:DMA, hdd:pio
Probing IDE interface ide0...
hda: Virtual HD, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Probing IDE interface ide1...
hdc: Virtual CD, ATAPI CD/DVD-ROM drive
ide1 at 0x170-0x177,0x376 on irq 15
hda: max request size: 512KiB
hda: 419430400 sectors (214748 MB) w/64KiB Cache, CHS=26108/255/63, DMA
hda: hda1 hda2 hda3
ide-floppy driver 0.99.newide
usbcore: registered new driver hiddev
usbcore: registered new driver usbhid
drivers/usb/input/hid-core.c: v2.6:USB HID core driver
PNP: PS/2 Controller [PNP0303:pS2K,PNP0f03:pS2M] at 0x60,0x64 irq 1,12
serio: i8042 KBD port at 0x60,0x64 irq 1
serio: i8042 AUX port at 0x60,0x64 irq 12
mice: PS/2 mouse device common for all mice
md: md driver 0.90.3 MAX_MD_DEVS=256, MD_SB_DISKS=27
md: bitmap version 4.39
TCP bic registered
Initializing IPsec netlink socket
NET: Registered protocol family 1
NET: Registered protocol family 17
ACPI: (supports S0 S5)
Initalizing network drop monitor service
Freeing unused kernel memory: 212k freed
Write protecting the kernel read-only data: 504k
input: AT Translated Set 2 keyboard as /class/input/input0
trackpoint.c: failed to get extended button data
ohci_hcd: 2005 April 22 USB 1.1 'Open' Host Controller (OHCI) Driver (PCI)
USB Universal Host Controller Interface driver v3.0
SCSI subsystem initialized
libata version 3.00 loaded.
device-mapper: uevent: version 1.0.3
device-mapper: ioctl: 4.11.5-ioctl (2007-12-12) initialised: [email protected]
device-mapper: dm-raid45: initialized v0.2594l
IBM TrackPoint firmware: 0x01, buttons: 0/0
input: TPPS/2 IBM TrackPoint as /class/input/input1
EXT3-fs: INFO: recovery required on readonly filesystem.
EXT3-fs: write access will be enabled during recovery.
kjournald starting. Commit interval 5 seconds
EXT3-fs: hda2: orphan cleanup on readonly fs
ext3_orphan_cleanup: deleting unreferenced inode 50234615
ext3_orphan_cleanup: deleting unreferenced inode 41058312
ext3_orphan_cleanup: deleting unreferenced inode 41058311
ext3_orphan_cleanup: deleting unreferenced inode 41058310
ext3_orphan_cleanup: deleting unreferenced inode 41058309
ext3_orphan_cleanup: deleting unreferenced inode 41058308
EXT3-fs: hda2: 6 orphan inodes deleted
EXT3-fs: recovery complete.
EXT3-fs: mounted filesystem with ordered data mode.
SELinux: Disabled at runtime.
SELinux: Unregistering netfilter hooks
type=1404 audit(1280177842.204:2): selinux=0 auid=4294967295 ses=4294967295
Linux Tulip driver version 1.1.13 (May 11, 2002)
ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 9
ACPI: PCI Interrupt 0000:00:0a.0[A] -> Link [LNKA] -> GSI 9 (level, low) -> IRQ 9
tulip0: EEPROM default media type 100baseTx-FDX.
tulip0: Index #0 - Media 100baseTx (#3) described by a 21140 non-MII (0) block.
tulip0: Index #1 - Media 100baseTx-FDX (#5) described by a 21140 non-MII (0) block.
eth0: Digital DS21140 Tulip rev 32 at ffffc2000001e000, 00:15:5D:01:66:1C, IRQ 9.
input: PC Speaker as /class/input/input2
Floppy drive(s): fd0 is 1.44M
FDC 0 is an 82078.
hdc: ATAPI DVD-ROM drive, 0kB Cache, DMA
Uniform CD-ROM driver Revision: 3.20
piix4_smbus 0000:00:07.3: Found 0000:00:07.3 device
piix4_smbus 0000:00:07.3: SMB base address uninitialized - upgrade BIOS or use force_addr=0xaddr
lp: driver loaded but no devices found
NET: Registered protocol family 10
lo: Disabled Privacy Extensions
IPv6 over IPv4 tunneling driver
mtrr: type mismatch for f8000000,400000 old: write-back new: write-combining
ACPI: Power Button (FF) [PWRF]
ACPI: Mapper loaded
dell-wmi: No known WMI GUID found
md: Autodetecting RAID arrays.
md: autorun ...
md: ... autorun DONE.
device-mapper: multipath: version 1.0.5 loaded
EXT3 FS on hda2, internal journal
kjournald starting. Commit interval 5 seconds
EXT3 FS on hda1, internal journal
EXT3-fs: mounted filesystem with ordered data mode.
Adding 6289436k swap on /dev/hda3. Priority:-1 extents:1 across:6289436k
IA-32 Microcode Update Driver: v1.14a <[email protected]>
ip6_tables: (C) 2000-2006 Netfilter Core Team
eth0: Using EEPROM-set media 100baseTx-FDX.
Bluetooth: Core ver 2.10
NET: Registered protocol family 31
Bluetooth: HCI device and connection manager initialized
Bluetooth: HCI socket layer initialized
Bluetooth: L2CAP ver 2.8
Bluetooth: L2CAP socket layer initialized
Bluetooth: RFCOMM socket layer initialized
Bluetooth: RFCOMM TTY layer initialized
Bluetooth: RFCOMM ver 1.8
eth0: no IPv6 routers present
Bluetooth: HIDP (Human Interface Emulation) ver 1.1
mtrr: type mismatch for f8000000,400000 old: write-back new: write-combining
atkbd.c: Unknown key pressed (translated set 2, code 0x0 on isa0060/serio0).
atkbd.c: Use 'setkeycodes 00 <keycode>' to make it known.
atkbd.c: Unknown key released (translated set 2, code 0x0 on isa0060/serio0).
atkbd.c: Use 'setkeycodes 00 <keycode>' to make it known.
[root@server ~]#




Sorry for that LONG list - I just wasn't sure what will be important to show.
Any idea what is wrong with my setup?
Thank You
 
Back
Top