Mert Cevik

Forum Replies Created

Viewing 15 posts - 91 through 105 (of 181 total)

← 1 2 3 … 6 7 8 … 11 12 13 →

Author

Posts
March 13, 2024 at 4:49 pm in reply to: node (VM) on GATECH dataplane problem #6823
Mert Cevik
Moderator
Can you please add your Slice ID? This output above is not sufficient to spot your VM and the other nodes on the same slice.
March 13, 2024 at 12:39 am in reply to: Maintenance on multiple sites – Mar 12 #6819
Mert Cevik
Moderator
Dear Experimenters,

This maintenance is completed, all sites are available for experiments. VM slivers of the current slices have their PCI devices re-attached. Missing IP addresses on the dataplane interfaces should be restored from inside the VMs or available fable_api calls.
March 12, 2024 at 4:59 pm in reply to: Maintenance on multiple sites – Mar 12 #6818
Mert Cevik
Moderator
Dear Experimenters,

This maintenance is still in progress. We have to add MASS to the list of the sites for mass-w2.

Following slices have slivers that will be affected as described on the previous post:

– MASS-slice1(52bda536-6362-46a3-85e4-cc6fdcefb1d6)
– eibp_tes333t(6e5a7637-5bad-425f-986b-56da40b12f96)
– MySlice(719a5bda-e3f5-477b-9122-7c10383ff1df)
– ra-upf(d5e2a22d-7465-49a6-a861-ab6df923bc7e)
– smannuru_assignment4(eb23ac03-5a75-4057-8ffc-bec1755aeed0)
– OCTFAB(59be1a6b-dcd7-4d4f-9501-1f6cd72875b2)
– Slice1(12a3bc93-60bf-416b-b341-a8b611f0e7ee)

We will send updates when the maintenance is completed.
March 5, 2024 at 10:02 am in reply to: Long running slice stability issue. #6690
Mert Cevik
Moderator
Hi Fengping,

I realized that I missed your previous message on Feb 20 (sorry about that), and I noticed your update yesterday that you could reboot the VM. I will pay close attention to the thread, please let us know if you have issues.
March 4, 2024 at 12:58 pm in reply to: failed lease update – all units failed in priming #6676
Mert Cevik
Moderator
Hi Bruce,

Sharing GitHub link can be fine, or one of the text sharing (paste) sites can be fine, or I created a (temporary) Google folder (link below) that you can just upload the file (then I will remove quickly).

https://drive.google.com/drive/folders/1IqIXaj-BMFQGDJg23Pnids3U_OgEDV8f?usp=sharing
March 4, 2024 at 8:22 am in reply to: failed lease update – all units failed in priming #6671
Mert Cevik
Moderator
Hi Bruce,

Provisioning on WASH is working well now. We need to investigate the issue. As a datapoint, can you let us know about the slice you are creating? If this is a Jupyter notebook, is it possible to share with us?
March 1, 2024 at 11:31 am in reply to: Intermittent traffic interruptions (management network) on FABRIC-GPN #6658
Mert Cevik
Moderator
Dear Experimenters,

The issue with intermittent traffic interruptions on the management network (ssh to VMs from public internet) of FABRIC-GPN is resolved.
February 29, 2024 at 4:46 pm in reply to: failed lease update – all units failed in priming #6650
Mert Cevik
Moderator
I did not see your recent message with the error while typing my message, so that’s actually a problem caused by the available IPv4 addresses. I hope the information I posted can be useful. Also, we are working with MAX, MASS and TACC sites to switch to IPv6 management subnets. This will be possible in the next few weeks and we will announce.
February 29, 2024 at 4:43 pm in reply to: failed lease update – all units failed in priming #6649
Mert Cevik
Moderator
Hi Bruce,

I cannot see the specific error on your message, but as I can see, MAX is loaded with VMs to consume all floating IP address space on it. On FABRIC Testbed, majority of the sites (nodes) are configured with IPv6 management (ssh to VMs) network. Some sites (nodes) have IPv4 connectivity for management. On the IPv4 sites, number of VMs that can be provisioned is limited with the available IPv4 addresses in the subnet. On IPv6 sites, we don’t have any such limitation for the number of VMs caused by the addresses. I will suggest using IPv6 sites and I will add a list below.
- Sites with IPv4 management network: MAX, TACC, MASS, UCSD, FIU, SRI, BRIST, TOKY (upcoming)
- Sites with IPv6 management network: STAR, MICH, UTAH, NCSA, WASH, DALL, SALT, GPN, CLEM, GATECH, LOSA, NEWY, KANS, ATLA, SEAT, PRIN, INDI, PSC, RUTG, CERN, AMST, HAWI
Note: Among the IPv4 sites, MAX, TACC, MASS, BRIST have smaller IPv4 subnets. (MAX, TACC, MASS are already loaded, BRIST has space to accommodate VMs). UCSD, FIU, SRI have fairly larger subnets.
February 20, 2024 at 12:09 pm in reply to: Maintenance on FABRIC-PSC due to hardware fault #6590
Mert Cevik
Moderator
FABRIC-PSC is back online, available for experiments.
February 19, 2024 at 6:46 pm in reply to: Long running slice stability issue. #6584
Mert Cevik
Moderator
Hello Fengping,

VMs on your slice are started and their dataplane interfaces are re-attached.
February 18, 2024 at 3:05 pm in reply to: Long running slice stability issue. #6580
Mert Cevik
Moderator
Hello Fengping,

We are working on this problem. We will post updates about the VMs.
February 5, 2024 at 5:17 pm in reply to: reserving slice – server failed #6526
Mert Cevik
Moderator
Hello Nirmala,

Problem on FABRIC-NEWY is fixed. You should be able to create your slices. Please let us know if you receive any other errors.
February 1, 2024 at 2:05 pm in reply to: When will Fabric be UP #6498
Mert Cevik
Moderator
Hello Jacob, Vaiden,

Testbed is open for experiments – https://learn.fabric-testbed.net/forums/topic/testbed-under-maintenance/#post-6497

Problem was not directly related to anything about JupyterHub that would cause invalid certificate error, but you should try now. Please let us know if you have issues. It can be better to create a new entry.
February 1, 2024 at 2:01 pm in reply to: Testbed under maintenance #6497
Mert Cevik
Moderator
Dear Experimenters,

Problem is resolved, testbed is open for experiments.
Author

Posts

Viewing 15 posts - 91 through 105 (of 181 total)

← 1 2 3 … 6 7 8 … 11 12 13 →