Server power on but unreachable

virtualization
v7
hardware

(Roberto) #1

NethServer Version: Final 7.3
Module: System
Hi everyone,
In one day it has happened two times.
I find the server turned on but not reachable by web or console.
I have to restart and everything resumes normally, I think of a HW problem.
Is there a log I can consult to find the problem?
Thank you


(James Nesbitt) #2

For potential hardware failure, first place to look is /var/log/dmesg and after that /var/log/messages


(Roberto) #3

Hi,
I didn’t find /var/log/dmesg bat in /var/log/messages l found this values:

FIRST TIME

Oct 24 19:50:03 neth clamd: SelfCheck: Database status OK.
Oct 24 19:50:09 neth nmbd[1688]: [2017/10/24 19:50:09.753186, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 24 19:50:09 neth nmbd[1688]: Packet send failed to 192.168.122.255(138) ERRNO=Operation not permitted
Oct 24 19:51:59 neth nmbd[1688]: [2017/10/24 19:51:59.996491, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 24 19:51:59 neth nmbd[1688]: Packet send failed to 192.168.122.255(137) ERRNO=Operation not permitted
Oct 24 19:51:59 neth nmbd[1688]: [2017/10/24 19:51:59.996599, 0] …/source3/nmbd/nmbd_packets.c:179(send_netbios_packet)
Oct 24 19:51:59 neth nmbd[1688]: send_netbios_packet: send_packet() to IP 192.168.122.255 port 137 failed
Oct 24 19:51:59 neth nmbd[1688]: [2017/10/24 19:51:59.996634, 0] …/source3/nmbd/nmbd_namequery.c:245(query_name)
Oct 24 19:51:59 neth nmbd[1688]: query_name: Failed to send packet trying to query name WORKGROUP<1d>
Oct 24 19:52:32 neth kernel: [drm:gen9_set_dc_state [i915]] ERROR DC state mismatch (0x0 -> 0x2)

Oct 25 09:03:19 neth journal: Runtime journal is using 8.0M (max allowed 384.4M, trying to leave 576.7M free of 3.7G available ? current limit 384.4M).

Start manuale del server

Oct 25 09:03:19 neth kernel: microcode: microcode updated early to revision 0xba, date = 2017-04-09
Oct 25 09:03:19 neth kernel: Initializing cgroup subsys cpuset
Oct 25 09:03:19 neth kernel: Initializing cgroup subsys cpu
Oct 25 09:03:19 neth kernel: Initializing cgroup subsys cpuacct
Oct 25 09:03:19 neth kernel: Linux version 3.10.0-693.2.2.el7.x86_64 (builder@kbuilder.dev.centos.org) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-16) (GCC) ) #1 SMP Tue Sep 12 22:26:13 UTC 2017


SECOND TIME

Oct 25 19:30:02 neth clamd: SelfCheck: Database status OK.
Oct 25 19:32:01 neth nmbd[1530]: [2017/10/25 19:32:01.412655, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 25 19:32:01 neth nmbd[1530]: Packet send failed to 192.168.122.255(137) ERRNO=Operation not permitted
Oct 25 19:32:01 neth nmbd[1530]: [2017/10/25 19:32:01.412775, 0] …/source3/nmbd/nmbd_packets.c:179(send_netbios_packet)
Oct 25 19:32:01 neth nmbd[1530]: send_netbios_packet: send_packet() to IP 192.168.122.255 port 137 failed
Oct 25 19:32:01 neth nmbd[1530]: [2017/10/25 19:32:01.412812, 0] …/source3/nmbd/nmbd_namequery.c:245(query_name)
Oct 25 19:32:01 neth nmbd[1530]: query_name: Failed to send packet trying to query name WORKGROUP<1d>

Start manuale server

Oct 25 19:38:30 neth kernel: microcode: microcode updated early to revision 0xba, date = 2017-04-09
Oct 25 19:38:30 neth kernel: Initializing cgroup subsys cpuset
Oct 25 19:38:30 neth kernel: Initializing cgroup subsys cpu
Oct 25 19:38:30 neth kernel: Initializing cgroup subsys cpuacct
Oct 25 19:38:30 neth kernel: Linux version 3.10.0-693.2.2.el7.x86_64 (builder@kbuilder.dev.centos.org) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-16) (GCC) ) #1 SMP Tue Sep 12 22:26:13 UTC 2017


(Roberto) #4

I found also /var/log/dmsg.
How do l read this log, what i havo to look for?


(Roberto) #5

Hi…
It happened again, I had to force the server restart manually.
Nobody knows how to help me?
Thank you


(Marc) #6

When it happens do you still have physical access to a working console?


(Markus Neuberger) #7

You should look for hardware errors.

https://www.tecmint.com/dmesg-commands/

Basic problem with logfiles and hardware errors: Maybe logging is already dead.
Did you look in your BIOS? Maybe you find something there…

If the system is frozen, maybe there’s an error message on your server screen…


(Roberto) #8

No. i must restart phisicaly…


(Roberto) #9

In server screen nobody. I look for in BIOS.

in the message log, I see that before the reboot, the server goes to query ip 192.168.122.255 which is the sub net of the virtual machine’s default network adapter.
On Saturday, I disable the virtual network adapter and still and the problem has not yet reappeared.
there may be a relationship?


(Marc) #10

I suspect of the kernel and Intel i915 graphics.


(Michael Träumner) #11

If it still works I think it’s a problem with the virtual network driver. Can you tell us something about the configuration please.


(Roberto) #12

since I disable the “default” network adapter in virtual machines, it has not happened anymore.

My configuration:


(Michael Träumner) #13

Can you post a few lines before to a few lines after the message. I try to find out what is happening here.

The pictures you posted are very small, I can’t read anything. But what I get is that you install a virtual mashine at nethserver. So it shouldn’t be a driver problem.


(Roberto) #14

Look for here…

Oct 27 19:05:01 neth systemd: Starting Session 2683 of user root.
Oct 27 19:05:01 neth systemd: Started Session 2685 of user root.
Oct 27 19:05:01 neth systemd: Starting Session 2685 of user root.
Oct 27 19:05:01 neth systemd: Started Session 2684 of user root.
Oct 27 19:05:01 neth systemd: Starting Session 2684 of user root.
Oct 27 19:07:04 neth nmbd[1797]: [2017/10/27 19:07:04.227737, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 27 19:07:04 neth nmbd[1797]: Packet send failed to 192.168.122.255(137) ERRNO=Operation not permitted
Oct 27 19:07:04 neth nmbd[1797]: [2017/10/27 19:07:04.227883, 0] …/source3/nmbd/nmbd_packets.c:179(send_netbios_packet)
Oct 27 19:07:04 neth nmbd[1797]: send_netbios_packet: send_packet() to IP 192.168.122.255 port 137 failed
Oct 27 19:07:04 neth nmbd[1797]: [2017/10/27 19:07:04.227919, 0] …/source3/nmbd/nmbd_namequery.c:245(query_name)
Oct 27 19:07:04 neth nmbd[1797]: query_name: Failed to send packet trying to query name WORKGROUP<1d>
Oct 27 19:10:01 neth systemd: Started Session 2686 of user root.
Oct 27 19:10:01 neth systemd: Starting Session 2686 of user root.
Oct 27 19:10:01 neth systemd: Started Session 2688 of user root.
Oct 27 19:10:01 neth systemd: Starting Session 2688 of user root.
Oct 27 19:10:01 neth systemd: Started Session 2687 of user root.
Oct 27 19:10:01 neth systemd: Starting Session 2687 of user root.
Oct 27 19:10:01 neth systemd: Started Session 2689 of user root.
Oct 27 19:10:01 neth systemd: Starting Session 2689 of user root.
Oct 27 19:10:02 neth clamd: SelfCheck: Database modification detected. Forcing reload.
Oct 27 19:10:02 neth clamd: Reading databases from /var/lib/clamav
Oct 27 19:10:09 neth clamd: Database correctly reloaded (6473002 signatures)
Oct 27 19:10:36 neth nmbd[1797]: [2017/10/27 19:10:36.554221, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 27 19:10:36 neth nmbd[1797]: Packet send failed to 192.168.122.255(138) ERRNO=Operation not permitted
Oct 27 19:12:04 neth nmbd[1797]: [2017/10/27 19:12:04.229406, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 27 19:12:04 neth nmbd[1797]: Packet send failed to 192.168.122.255(137) ERRNO=Operation not permitted
Oct 27 19:12:04 neth nmbd[1797]: [2017/10/27 19:12:04.229527, 0] …/source3/nmbd/nmbd_packets.c:179(send_netbios_packet)
Oct 27 19:12:04 neth nmbd[1797]: send_netbios_packet: send_packet() to IP 192.168.122.255 port 137 failed
Oct 27 19:12:04 neth nmbd[1797]: [2017/10/27 19:12:04.229563, 0] …/source3/nmbd/nmbd_namequery.c:245(query_name)
Oct 27 19:12:04 neth nmbd[1797]: query_name: Failed to send packet trying to query name WORKGROUP<1d>
Oct 27 19:15:01 neth systemd: Started Session 2691 of user root.
Oct 27 19:15:01 neth systemd: Starting Session 2691 of user root.
Oct 27 19:15:01 neth systemd: Started Session 2692 of user root.
Oct 27 19:15:01 neth systemd: Starting Session 2692 of user root.
Oct 27 19:15:01 neth systemd: Started Session 2690 of user root.
Oct 27 19:15:01 neth systemd: Starting Session 2690 of user root.
Oct 27 19:17:04 neth nmbd[1797]: [2017/10/27 19:17:04.235424, 0] …/source3/libsmb/nmblib.c:873(send_udp)
Oct 27 19:17:04 neth nmbd[1797]: Packet send failed to 192.168.122.255(137) ERRNO=Operation not permitted
Oct 27 19:17:04 neth nmbd[1797]: [2017/10/27 19:17:04.235543, 0] …/source3/nmbd/nmbd_packets.c:179(send_netbios_packet)
Oct 27 19:17:04 neth nmbd[1797]: send_netbios_packet: send_packet() to IP 192.168.122.255 port 137 failed
Oct 27 19:17:04 neth nmbd[1797]: [2017/10/27 19:17:04.235579, 0] …/source3/nmbd/nmbd_namequery.c:245(query_name)
Oct 27 19:17:04 neth nmbd[1797]: query_name: Failed to send packet trying to query name WORKGROUP<1d>

RIAVVIO MANUALE (MANUAL RESTART)

Oct 28 11:22:13 neth journal: Runtime journal is using 8.0M (max allowed 384.4M, trying to leave 576.7M free of 3.7G available → current limit 384.4M).
Oct 28 11:22:13 neth kernel: microcode: microcode updated early to revision 0xba, date = 2017-04-09
Oct 28 11:22:13 neth kernel: Initializing cgroup subsys cpuset
Oct 28 11:22:13 neth kernel: Initializing cgroup subsys cpu
Oct 28 11:22:13 neth kernel: Initializing cgroup subsys cpuacct
Oct 28 11:22:13 neth kernel: Linux version 3.10.0-693.2.2.el7.x86_64 (builder@kbuilder.dev.centos.org) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-16) (GCC) ) #1 SMP Tue Sep 12 22:26:13 UTC 2017
Oct 28 11:22:13 neth kernel: Command line: BOOT_IMAGE=/vmlinuz-3.10.0-693.2.2.el7.x86_64 root=/dev/mapper/VolGroup-lv_root ro crashkernel=auto rd.md.uuid=95ffe556:a12edfe5:123faff3:92c2e247 rd.lvm.lv=VolGroup/lv_root rd.md.uuid=458a6b9a:b3bd541c:99ef5575:994de455 rd.lvm.lv=VolGroup/lv_swap nodmraid rhgb quiet LANG=en_US.UTF-8
Oct 28 11:22:13 neth kernel: e820: BIOS-provided physical RAM map:
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000000000000-0x00000000000907ff] usable
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000000090800-0x000000000009ffff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000000100000-0x0000000083f20fff] usable
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000083f21000-0x0000000083f21fff] ACPI NVS
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000083f22000-0x0000000083f4bfff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000083f4c000-0x0000000083fa0fff] usable
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000083fa1000-0x0000000084791fff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000084792000-0x000000008a2adfff] usable
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008a2ae000-0x000000008b8dbfff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008b8dc000-0x000000008b92cfff] ACPI data
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008b92d000-0x000000008bfa3fff] ACPI NVS
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008bfa4000-0x000000008c4fefff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008c4ff000-0x000000008c4fffff] usable
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x000000008c500000-0x000000008c5fffff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000e0000000-0x00000000efffffff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000fe000000-0x00000000fe010fff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000fec00000-0x00000000fec00fff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x00000000ff000000-0x00000000ffffffff] reserved
Oct 28 11:22:13 neth kernel: BIOS-e820: [mem 0x0000000100000000-0x000000026dffffff] usable
Oct 28 11:22:13 neth kernel: NX (Execute Disable) protection: active
Oct 28 11:22:13 neth kernel: SMBIOS 2.8 present.
Oct 28 11:22:13 neth kernel: e820: last_pfn = 0x26e000 max_arch_pfn = 0x400000000
Oct 28 11:22:13 neth kernel: x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106
Oct 28 11:22:13 neth kernel: e820: last_pfn = 0x8c500 max_arch_pfn = 0x400000000
Oct 28 11:22:13 neth kernel: found SMP MP-table at [mem 0x000fcdd0-0x000fcddf] mapped at [ffff8800000fcdd0]
Oct 28 11:22:13 neth kernel: Using GB pages for direct mapping
Oct 28 11:22:13 neth kernel: RAMDISK: [mem 0x355f0000-0x36aeffff]
Oct 28 11:22:13 neth kernel: Early table checksum verification disabled
Oct 28 11:22:13 neth kernel: ACPI: RSDP 00000000000f05b0 00024 (v02 DELL )
Oct 28 11:22:13 neth kernel: ACPI: XSDT 000000008b8fd0b0 000DC (v01 DELL CBX3 01072009 AMI 00010013)
Oct 28 11:22:13 neth kernel: ACPI: FACP 000000008b921230 0010C (v05 DELL CBX3 01072009 AMI 00010013)
Oct 28 11:22:13 neth kernel: ACPI: DSDT 000000008b8fd218 24016 (v02 DELL CBX3 01072009 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: FACS 000000008bfa2f80 00040
Oct 28 11:22:13 neth kernel: ACPI: APIC 000000008b921340 00084 (v03 DELL CBX3 01072009 AMI 00010013)
Oct 28 11:22:13 neth kernel: ACPI: FPDT 000000008b9213c8 00044 (v01 DELL CBX3 01072009 AMI 00010013)
Oct 28 11:22:13 neth kernel: ACPI: FIDT 000000008b921410 0009C (v01 DELL CBX3 01072009 AMI 00010013)
Oct 28 11:22:13 neth kernel: ACPI: MCFG 000000008b9214b0 0003C (v01 DELL CBX3 01072009 MSFT 00000097)
Oct 28 11:22:13 neth kernel: ACPI: HPET 000000008b9214f0 00038 (v01 DELL CBX3 01072009 AMI. 0005000B)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b921528 0036D (v01 SataRe SataTabl 00001000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b921898 0546C (v02 SaSsdt SaSsdt 00003000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: UEFI 000000008b926d08 00042 (v01 00000000 00000000)
Oct 28 11:22:13 neth kernel: ACPI: LPIT 000000008b926d50 00094 (v01 INTEL SKL 00000000 MSFT 0000005F)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b926de8 00615 (v02 INTEL DELL__Bl 00000000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b927400 00248 (v02 INTEL sensrhub 00000000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b927648 02BAE (v02 INTEL PtidDevc 00001000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b92a1f8 00BE3 (v02 INTEL Ther_Rvp 00001000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: DBGP 000000008b92ade0 00034 (v01 INTEL 00000000 MSFT 0000005F)
Oct 28 11:22:13 neth kernel: ACPI: DBG2 000000008b92ae18 00054 (v00 INTEL 00000000 MSFT 0000005F)
Oct 28 11:22:13 neth kernel: ACPI: SSDT 000000008b92ae70 00E73 (v02 CpuRef CpuSsdt 00003000 INTL 20120913)
Oct 28 11:22:13 neth kernel: ACPI: DMAR 000000008b92bce8 000A8 (v01 INTEL SKL 00000001 INTL 00000001)
Oct 28 11:22:13 neth kernel: ACPI: ASF! 000000008b92bd90 000A5 (v32 INTEL HCG 00000001 TFSM 000F4240)
Oct 28 11:22:13 neth kernel: ACPI: EINJ 000000008b92be38 00130 (v01 AMI AMI.EINJ 00000000 AMI. 00000000)
Oct 28 11:22:13 neth kernel: ACPI: ERST 000000008b92bf68 00230 (v01 AMIER AMI.ERST 00000000 AMI. 00000000)
Oct 28 11:22:13 neth kernel: ACPI: BERT 000000008b92c198 00030 (v01 AMI AMI.BERT 00000000 AMI. 00000000)
Oct 28 11:22:13 neth kernel: ACPI: HEST 000000008b92c1c8 000A8 (v01 AMI AMI.HEST 00000000 AMI. 00000000)


(Michael Träumner) #15

Hi,

it looks like your firewall is blocking port 137 for NetBIOS. Please open it and try again with activated virtual network.