Forum Replies Created
-
AuthorPosts
-
Hello,
Could you please ensure that
configure_environment.ipynb
notebook has been executed and your environment is setup? Also, please remove any entries forfabrictestbed-extensions
you may have infabric_config/requirements.txt
and restart your container and attempt creating the slice again?Thanks,
Komal
June 8, 2023 at 5:37 pm in reply to: Maintenance on FABRIC-Credential Manager – 06/08/2023 (5:00pm -5:30pm EST) #4515Credential Manager has been updated and the maintenance is complete!
June 8, 2023 at 10:34 am in reply to: Maintenance on JupyterHub -06/08/2023 – 10:00 am – 10:30 am #4507SSL certs updated and maintenance is complete.
June 6, 2023 at 9:38 pm in reply to: Fail to launch “default” JupyterHub server for brand new account #4488Thank you for reporting this! It’s a bug, for now I have patched the JH with the fix. I will push it to the github repo as well.
Thanks,
Komal
June 1, 2023 at 6:09 pm in reply to: Maintenance on FABRIC JupyterHub – 06/1/2023 – 1:00 pm – 4:00 pm #4445Closing the topic!
June 1, 2023 at 6:09 pm in reply to: Maintenance on FABRIC JupyterHub – 06/1/2023 – 1:00 pm – 4:00 pm #4444Maintenance is complete. Jupyter Hub has been updated to provide following different container options:
- FABRIC Examples v1.4.3, FABlib v1.4.1: released: 04/23/2023, stable: Spring 2023
- (default) FABRIC Examples v1.4.6, FABlib v1.4.4: released: 05/21/2023, stable: Summer 2023
- (bleeding edge) FABRIC Examples latest, FABlib latest, Release 1.4
Thanks,
Komal
May 24, 2023 at 5:54 pm in reply to: Get_slice and list_nodes execution takes more than 10 minutes. #4311Hi Nagmat,
Could you please check if your bastion keys are still valid? Also, could you please share and check for errors in
/tmp/fabric/fablib.log
?Thanks,
KomalHi Nagmat,
I don’t see any failures for this slice either. It seems like this slice was deleted on 2023-05-16 17:31:42 UTC.
However, I do see that, another slice created by you yesterday failed with “Insufficient resources error” which would explain the notebook behavior.
Reservation ID: fb7939b5-8f8b-4517-a975-3da648a190c3 Slice ID: 55c64491-4edc-44cc-885f-264f9fa4cbc2 Resource Type: VM Notices: Reservation fb7939b5-8f8b-4517-a975-3da648a190c3 (Slice basic_nagm01(55c64491-4edc-44cc-885f-264f9fa4cbc2) Graph Id:348882a5-575c-438e-a3aa-18c1c933794e Owner:nagmat@nevada.unr.edu) is in state (Closed,None_) (Last ticket update: Insufficient resources : Component of type: ConnectX-5 not available in graph node: 8QTDZC3) Start: 2023-05-22 19:28:49 +0000 End: 2023-05-23 19:28:47 +0000 Requested End: 2023-05-23 19:28:47 +0000
Thanks,
KomalHi Nagmat,
Could you please share slice id for the slice(s) if possible? Also, you could check in Portal under Experiment->Slices, please select “Include Dead/Closing” slices. Any errors that occurred in those slices should be displayed when you view the slice on the portal.
Thanks,
KomalMay 21, 2023 at 4:19 pm in reply to: Maintenance on FABRIC Orchestrator/Broker – 05/19/2023 – 4:00pm-5:00pm #4295Maintenance was completed!
May 18, 2023 at 9:52 am in reply to: [Maintenance] 05/18/2023 10:00 – 11:00 am (EDT) – orchestrator/broker update #4276The maintenance has been completed.
Apologies I missed the screenshot, I see that you are trying to upload to the node without doing a get on it.
Could you please try adding the following statements before upload to ensure that node object has all the information from the slice?
s1 = slice.get_node(name="s1") s3 = slice.get_node(name="s3")
The above statements should be added after the cell which does a
get_slice()
.Also regarding the failed VM:
h2
– unfortunately it is again because of disk un-availability.s2
andh2
are both requested on MICH and each VM is requesting 500G. Based on current allocation of MICH, only one can be provisioned.This is because of a known issue: availability discrepancy between software and infrastructure. We are working on the plans to resolve this, until then if possible request VMs with smaller disks.
Thanks,
KomalHi Nagmat,
You have 4 active slices:
Slice Name: mri_ded3 Slice ID: d54fd501-ca14-49ce-b217-50c593bd0927 Project ID: 527832fc-c273-4254-b988-16e5c2923bf9 Project Name: in-network caching Slice Name: mri1 Slice ID: 6ce8648d-26a9-47b8-8349-4e3e724085a0 Project ID: 527832fc-c273-4254-b988-16e5c2923bf9 Project Name: in-network caching Slice Name: mri3 Slice ID: fa527279-b9bb-4dac-bb68-7a46b63c8ad1 Project ID: 527832fc-c273-4254-b988-16e5c2923bf9 Project Name: in-network caching Slice Name: Nagm_P4Test01 Slice ID: 140247fe-cb45-47da-a797-dca5592487dd Project ID: 527832fc-c273-4254-b988-16e5c2923bf9 Project Name: in-network caching
The last slice
Nagm_P4Test01
has two VM slivers on FIU and the VM slivers are in ActiveTicketed state – which means a pending Renew. We have been observing network issues with FIU and working to resolve that since yesterday. FIU was moved to maintenance as well. I do see management IPs for these VMs but it is possible that they are not accessible because of the network issue.All other slices have all the VMs and Networks in Active state. If you are observing failure with these slices, please do a
slice = fablib.get_slice(slice_name)
and then try to upload the file.Thanks,
Komal- This reply was modified 1 year, 6 months ago by Komal Thareja.
Thank you Paul!
@Nagmat,
Could you please share the slice id for your slice?Thanks,
KomalMay 10, 2023 at 4:38 pm in reply to: I am getting “Playbook has failed tasks: Error in creating the server” on FIU. #4199Hi Nagmat,
I looked at your slice and the error reported above, your slice is requesting more than one VM on FIU and they are being allocated to
fiu-w3.fabric-testbed.net
. All the VMs are of the flavor:fabric.c8.m16.d500
. Current disk availability onfiu-w3.fabric-testbed.net
can only accomodate one such VM so all others fail.We do have known discrepancy in the disk availability between the software and the infrastructure. We have plans to address that soon. Could you please use a different site instead of FIU for now?
Appreciate your feedback!
Thanks,
Komal -
AuthorPosts