Forum Replies Created
-
AuthorPosts
-
January 17, 2025 at 4:39 pm in reply to: L2 Interfaces on my slice transitioning to DOWN State #8097
Subject: Network Configuration Issue on Slice VMs
Hi Prateek,
I checked your Slice. Could you share the VMs and sites where the network configuration was lost?
The WASH and STAR site workers were rebooted due to another issue, which may have caused this disruption. Please note that, in the current version, fablib configures interfaces using
ip
commands, which are not persistent across reboots.We are working on making this configuration reboot persistent. In the meantime, please consider using NetworkManager or netplan to configure the interfaces in a way that persists after a reboot.
Additionally, we are addressing the underlying issue that required the worker node reboots.
Apologies for the inconvenience, and thank you for your patience!
Best,
KomalHi Nirmala,
Could you please share your slice id?
Thanks,
KomalHi Yuanjun,
We had leaked config on the switch which has been cleared by help from Network Team. Could you please try your slice again? Please let us know if you still see the issue.
Thanks,
KomalUser has confirmed in another post that this issue has been resolved.
January 17, 2025 at 9:03 am in reply to: SSH Connection Error: ChannelException(2, ‘Connect failed’) #8084User has confirmed that this is no longer an issue.
HI, KomalThe problem has been solved. I believe we do not need the meeting today.
Thank you so much for your help!
Best Regards,
Yuanjun Dai
January 17, 2025 at 9:02 am in reply to: Real-Time Update Issue in fablib.show_site API After Resource Release #8083Subject: API Behavior Verification
Hi Yuanjun,
I have verified this, and the API is functioning as intended.
fablib.list_sites()
retrieves resource information from the testbed. Once fetched, users can display data for a specific site usingfablib.show_site()
. Please note thatfablib.show_site()
only presents site information that was previously retrieved viafablib.list_sites()
.To refresh the resource information, you need to call
fablib.list_sites(update=True)
, followed byfablib.show_site()
.Please check this in your code, and let us know if you continue to experience any issues.
Thanks,
KomalJanuary 16, 2025 at 3:38 pm in reply to: SSH Connection Error: ChannelException(2, ‘Connect failed’) #8076Hi Yuanjun,
Both the issues with upload_file and this point to SSH access to the VMs. It could be because of the expired bastion keys or a configuration issue.
Is it possible for a quick zoom meeting to resolve this? Please let me know if that works.Thanks,
KomalHi Ilya,
I checked the host and see a similar output. Will check with Mert regarding the reboot.
0000:25:00.0 Network controller: Xilinx Corporation Device 903f
Subsystem: Xilinx Corporation Device 0007
Physical Slot: 2-1
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-SERR- Thanks,
KomalHi Ilya,
Hussam helped recover the WASH worker. Could you please try your slice again on WASH?
Thanks,
Komal
Could you please check for any error messages in
/tmp/fablib/fablib.log
file when upload_file() is invoked? If possible, share this file?This file should exist on the system from where the fablib is being invoked.
Thanks,
Komal
January 16, 2025 at 10:40 am in reply to: Real-Time Update Issue in fablib.show_site API After Resource Release #8063Thank you for sharing your observation. I will investigate this more and share upate.
Thanks,
Komal
Could you run
fablib.show_config()
to display the log file in use?Additionally, you can specify the log file location in the
fabric_rc
file, which may assist in debugging. Upload failures often occur if the bastion keys have expired, so please verify that your bastion keys are still valid.Thanks,
Komal
January 16, 2025 at 10:36 am in reply to: Real-Time Update Issue in fablib.show_site API After Resource Release #8060Hi Yuanjun,
This behavior is intentional; resource information is retrieved from cached data. Due to the scale of the system, updating it after each create or delete operation would be resource-intensive. Instead, the cache is refreshed at regular intervals, currently every 30 minutes.
Thanks,
Komal
/tmp/fablib/fablib.log
log file exists on JH container. Could you please check and share that?Thanks,
Komal
Hi Yuanjun,
Could you please try using
upload_directory
API and let us know if that works?Also, are you able to SSH to the VMs? Could you please check if you see any errors in
/tmp/fablib/fablib.log
?If so, please share the log file.
Thanks,
Komal
-
AuthorPosts