SG3452 disables "OMADA Management" settings by itself or buggy OMADA infos - recovery is a mess
Dear OMADA gurus,
i have a scenario with a SG3254, OMADA on DEBIAN , a couple of other OMADA switches and some EAPs. So far all went very good. I used all available LAGG on the SG3452 groups with LACP to connect ESX, Proxmox, Firewall, etc. A number of VLANs, etc. etc. All of a sudden - during a configuration the SG lost connection to the OMADA controller (5.0.30). After some research to ensure that is is not a VLAN config failure I took a look with console / CLI and ...
=> CLI shows up with - Cloud-Based Controller Management: Disabled - magically change of this parameter ...
Network is running without any problems with the current configuration while the switch is disconnected - reboot(s) no change - so no chance to put the SG back into OMADA. The issue here is that it is impossible to put the Paramter back to the right direction because if the switch is under OMADA Control you can't access the appropriate config mode in CLI. So this is a catch22 situation.
As I had daily OMADA Backups I thought to go with a reset of the Switch via CLI and recover the last good known running config. Therefore I prepared a "special" subnet / DHCP / etc. with the appropriate Parameter so that the SG could find the OMADA after reset based on the device factory network 192.168.0.x/24. But the problem is after resetting the SG with all the LAG / LACP cabling the switch totally go crazy with >90% CPU load according to the "now" messy cabling if the switch comes up fresh from the reset.
Even trying to set quickly the respective ports down with CLI is in fact not possible because the SG CPU is > 90% and not really responsive at all. So the solution is to plug out most of the cables and then start the recovery procedure.
After understanding all this the recovery worked but as a conclusion I would like to state two things:
1.) Why or under which conditions does the Switch change the OMADA Paramter and drop itself out of the game - I would say > heavy BUG
2.) In case of recovery needs there should be some kind of accessible settings via CLI even the device is under OMADA control or a kind of boot/rescue mode with putting ip/dns/gateway, may be to de/activate ports for rescue procedure to avoid switch load because of the cabling issue which are good for original configuration OMADA settings in. Being forced to plug out all cabling is a mess. Funny enough firstly I picked a too young backup file where the switch went from adopting => configuring to disconnected - sp the same procedure again with all the cable mess. So I would suggest something strange happens or is in the config file.
These are my 2 cents
Any fruitful feedback would be highly appreciated.
Kind regards: a still convinced OMADA user ;-)