Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Server powered off then back online,Fan Contoller can't detect CPU2 Temperature #90

Closed
Handsome1080P opened this issue May 21, 2024 · 13 comments
Assignees

Comments

@Handsome1080P
Copy link

./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
------- Temperatures -------
Date & time Inlet CPU 1 CPU 2 Exhaust Active fan speed profile Third-party PCIe card Dell default cooling response Comment
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
21-05-2024 07:45:31 27°C 0°C 34°C User static fan control profile (15%)°C Disabled CPU temperature decreased and is now OK (<= 55°C), user's fan control profile applied.
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
21-05-2024 07:46:31 27°C 0°C 34°C User static fan control profile (15%)°C Disabled -
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
21-05-2024 07:47:31 27°C 0°C 34°C User static fan control profile (15%)°C Disabled -
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
21-05-2024 07:48:31 27°C 0°C 34°C User static fan control profile (15%)°C Disabled -
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
21-05-2024 07:49:34 27°C 0°C 34°C User static fan control profile (15%)°C Disabled -
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
21-05-2024 07:50:36 27°C 39°C -°C 34°C User static fan control profile (15%) Disabled -
21-05-2024 07:51:32 27°C 46°C -°C 33°C User static fan control profile (15%) Disabled -
21-05-2024 07:52:42 27°C 43°C -°C 33°C User static fan control profile (15%) Disabled -
21-05-2024 07:53:35 27°C 45°C -°C 34°C User static fan control profile (15%) Disabled -
21-05-2024 07:54:32 27°C 53°C -°C 34°C User static fan control profile (15%) Disabled -

@Handsome1080P Handsome1080P changed the title Server powered off then back online,Fan Contoller can't detected CPU2 Temperature Server powered off then back online,Fan Contoller can't detecte CPU2 Temperature May 21, 2024
@Handsome1080P Handsome1080P changed the title Server powered off then back online,Fan Contoller can't detecte CPU2 Temperature Server powered off then back online,Fan Contoller can't detect CPU2 Temperature May 21, 2024
@JonathanNathanson
Copy link

Simmilar messages here:

Fan speed objective: 5%
CPU temperature threshold: 50°C
Check interval: 30s

No exhaust temperature sensor detected.
No CPU2 temperature sensor detected.

./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
Unable to send RAW command (channel=0x0 netfn=0x30 lun=0x0 cmd=0xce rsp=0xc1): Invalid command
                     ------- Temperatures -------
    Date & time      Inlet  CPU 1  CPU 2  Exhaust          Active fan speed profile          Third-party PCIe card Dell default cooling response  Comment
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
24-05-2024 10:20:50    0°C    0°C  User static fan control profile (5%)°C  Enabled°C  CPU temperature decreased and is now OK (<= 50°C), user's fan control profile applied.                                                       
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
Unable to send RAW command (channel=0x0 netfn=0x30 lun=0x0 cmd=0xce rsp=0xc1): Invalid command
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
./Dell_iDRAC_fan_controller.sh: line 159: printf: -: invalid number
24-05-2024 10:21:20    0°C    0°C  User static fan control profile (5%)°C  Enabled°C                                         -                                                       
./Dell_iDRAC_fan_controller.sh: line 94: [: -gt: unary operator expected
Unable to send RAW command (channel=0x0 netfn=0x30 lun=0x0 cmd=0xce rsp=0xc1): Invalid command
...

@tigerblue77 tigerblue77 self-assigned this Nov 10, 2024
@tigerblue77
Copy link
Owner

Hello @Handsome1080P, sorry for the delay of my reply. Do you still encounter the same error ? Did you find any way to fix it ? What happens if you restart or recreate the container ?

Hello @JonathanNathanson, your problem looks different, please open a new issue so we can look at it.

@tigerblue77
Copy link
Owner

The errors message u can ignore.

No, I'd like to get them fixed if you can help me diagnose them ofc ?

but I don't know when server temp over what I set, reset to dynamic profile is working or not.

Well, an easy test would be to check your usual CPU temp, let's say it's 40 to 42°c, you set your threshold at 42°C and see if it switches to Dell dynamic profile 😉

@Handsome1080P
Copy link
Author

The errors message u can ignore.

No, I'd like to get them fixed if you can help me diagnose them ofc ?

but I don't know when server temp over what I set, reset to dynamic profile is working or not.

Well, an easy test would be to check your usual CPU temp, let's say it's 40 to 42°c, you set your threshold at 42°C and see if it switches to Dell dynamic profile 😉

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u.
About the automatic switch to default setting function.I didn't heard fan's RPM up to normal after temperature over settings.

@Handsome1080P
Copy link
Author

The errors message u can ignore.

No, I'd like to get them fixed if you can help me diagnose them ofc ?

but I don't know when server temp over what I set, reset to dynamic profile is working or not.

Well, an easy test would be to check your usual CPU temp, let's say it's 40 to 42°c, you set your threshold at 42°C and see if it switches to Dell dynamic profile 😉

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u. About the automatic switch to default setting function.I didn't heard fan's RPM up to normal after temperature over settings.

seems still hold manually control?

@tigerblue77
Copy link
Owner

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u.

Is the network wire connected to iDRAC port that you call "IPMI cable" ?

seems still hold manually control?

This is the first time I hear this security function now working. Can you provide the console logs (starting informations + lines where we see the temperature going above the threshold) please ?

@Handsome1080P
Copy link
Author

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u.

Is the network wire connected to iDRAC port that you call "IPMI cable" ?

seems still hold manually control?

This is the first time I hear this security function now working. Can you provide the console logs (starting informations + lines where we see the temperature going above the threshold) please ?

Now the outside temperature below 20,so can't test.

@Handsome1080P
Copy link
Author

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u.

Is the network wire connected to iDRAC port that you call "IPMI cable" ?

seems still hold manually control?

This is the first time I hear this security function now working. Can you provide the console logs (starting informations + lines where we see the temperature going above the threshold) please ?

Now the outside temperature below 20,so can't test.

------- Temperatures -------
Date & time Inlet CPU 1 CPU 2 Exhaust Active fan speed profile Third-party PCIe card Dell default cooling response Comment
10-11-2024 14:58:45 21°C 40°C -°C 34°C User static fan control profile (5%) Disabled -
10-11-2024 14:59:49 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled -

@Handsome1080P
Copy link
Author

Just disconnect IPMI cable for several seconds,then put it back.u will see this logs in console.But now I removed second CPU,so I cannot do something for u.

Is the network wire connected to iDRAC port that you call "IPMI cable" ?

seems still hold manually control?

This is the first time I hear this security function now working. Can you provide the console logs (starting informations + lines where we see the temperature going above the threshold) please ?

Now the outside temperature below 20,so can't test.

------- Temperatures ------- Date & time Inlet CPU 1 CPU 2 Exhaust Active fan speed profile Third-party PCIe card Dell default cooling response Comment 10-11-2024 14:58:45 21°C 40°C -°C 34°C User static fan control profile (5%) Disabled - 10-11-2024 14:59:49 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled -

Disabled -
/!\ WARNING /!\ Container stopped, Dell default dynamic fan control profile applied for safety.
iDRAC/IPMI username: root
iDRAC/IPMI password: 634218
Server model: DELL PowerEdge R730
iDRAC/IPMI host: 192.168.20.50
Fan speed objective: 5%
CPU temperature threshold: 50°C
Check interval: 60s

No CPU2 temperature sensor detected.

                 ------- Temperatures -------
Date & time      Inlet  CPU 1  CPU 2  Exhaust          Active fan speed profile          Third-party PCIe card Dell default cooling response  Comment

10-11-2024 15:02:18 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled CPU temperature decreased and is now OK (<= 50°C), user's fan control profile applied.
10-11-2024 15:03:15 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled -

@tigerblue77
Copy link
Owner

Now the outside temperature below 20,so can't test.

You can always test, just adapt the threshold

------- Temperatures -------
Date & time Inlet CPU 1 CPU 2 Exhaust Active fan speed profile Third-party PCIe card Dell default cooling response Comment
10-11-2024 14:58:45 21°C 40°C -°C 34°C User static fan control profile (5%) Disabled -
10-11-2024 14:59:49 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled -

Misses starting informations (first few lines)

@Handsome1080P
Copy link
Author

only container stopped will automatically switch to server control.Fans RPM back to normal (4000+).

@Handsome1080P
Copy link
Author

Now the outside temperature below 20,so can't test.

You can always test, just adapt the threshold

------- Temperatures -------
Date & time Inlet CPU 1 CPU 2 Exhaust Active fan speed profile Third-party PCIe card Dell default cooling response Comment
10-11-2024 14:58:45 21°C 40°C -°C 34°C User static fan control profile (5%) Disabled -
10-11-2024 14:59:49 21°C 39°C -°C 34°C User static fan control profile (5%) Disabled -

Misses starting informations (first few lines)

sorry,now not at home. Remote controlling 😔

@tigerblue77
Copy link
Owner

I'm sorry but I don't have sufficient informations to diagnose. Please get back to me here when you will have the time to give every information and I will re-open the case. Thanks

@tigerblue77 tigerblue77 closed this as not planned Won't fix, can't repro, duplicate, stale Nov 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants