Komal Thareja

Forum Replies Created

Viewing 15 posts - 1 through 15 (of 554 total)

1 2 3 … 35 36 37 →

Author

Posts
May 29, 2026 at 2:17 pm in reply to: Unable to create multisite slices and resource capped at 2 cores and 10GB #9837
Komal Thareja
Moderator
Service Project only gives you access to the Service. Please create your slice with LAMB project.

Best,

Komal
May 29, 2026 at 1:53 pm in reply to: FABRIC Distributed Storage is now available for active users! #9835
Komal Thareja
Moderator
Hi Vinaya,

Could you please your slice try again?

Please post any further questions/concerns here: https://learn.fabric-testbed.net/forums/forum/fabric-general-questions-and-discussion/

Best,

Komal
May 29, 2026 at 12:40 pm in reply to: Slice stuck on ‘Configuring’ for 4 days after extending #9832
Komal Thareja
Moderator
Hi Seena,

We recently had a Kafka outage (details here: https://learn.fabric-testbed.net/forums/topic/service-update-kafka-outage/). Unfortunately this caused your Renew to only partially succeed, leaving your slice stuck in the Configuring state. Your VMs are still set to expire on 06/03.

There are two ways we can resolve this, both done administratively on our end:

1. Delete the slice and re-provision it, or
2. Force the state to StableOK.

One thing to flag: a further slice extension may not be possible in either case.

Let me know how you’d like to proceed, and apologies for the inconvenience.

Best,
Komal
May 29, 2026 at 11:40 am in reply to: FABRIC Distributed Storage is now available for active users! #9831
Komal Thareja
Moderator
Hi Sree,

You should be able to pass the flag storage=True in multiple slices and still have the same volume mounted in both the slices simultaneously. Please feel free to reach out if you run into issues. Please post on any further questions here: https://learn.fabric-testbed.net/forums/forum/fabric-general-questions-and-discussion/

Best,

Komal
May 27, 2026 at 1:35 pm in reply to: Operations on Slices taking time #9810
Komal Thareja
Moderator
Thank you for reporting this Nirmala! System has been recovered. Please let us know if you continue to run into issues.

Best,

Komal
May 27, 2026 at 1:18 pm in reply to: Service Update — Kafka Outage #9809
Komal Thareja
Moderator
closing the topic!
May 27, 2026 at 1:17 pm in reply to: Service Update — Kafka Outage #9808
Komal Thareja
Moderator
Dear Users,

The issue has been resolved and service has been fully restored. We apologize for the inconvenience caused.

Happy Experimenting!

Best,

Komal
May 27, 2026 at 11:22 am in reply to: Operations on Slices taking time #9805
Komal Thareja
Moderator
Hi Nirmala,

We’re currently investigating what appears to be an unplanned outage on our Kafka service, which may be causing the slowness you’re experiencing. We are actively working on recovery, and I’ll keep you updated on our progress.

Best, Komal
May 27, 2026 at 10:37 am in reply to: Operations on Slices taking time #9803
Komal Thareja
Moderator
Hi Nirmala,

Could you please check if you see any errors in /tmp/fablib/fablib.log ?

Another thing for the Post Boot config delays could be expired bastion keys. Please run the notebook jupyter-examples-*/configure_and_validate/configure_and_validate.ipynb This shall renew your bastion keys if they are expired.

Please remember to update the Project ID in this notebook.

Please let me know in case you continue to run into issues.

Best,

Komal
May 10, 2026 at 10:59 am in reply to: UDP performance tuning for ubuntu 24.04 #9776
Komal Thareja
Moderator
Hi Jacob,

Take a look at this artifact. While it focuses on TCP performance, it also covers OS tuning and CPU pinning / NUMA tuning, both of which should help with your performance work.

One other thing worth considering is the type of NIC you’re using. Basic (virtual) NICs likely won’t give you peak performance — NIC_ConnectX-6 or NIC_ConnectX-5 would be much better candidates.

Best,
Komal
- This reply was modified 3 weeks, 4 days ago by Komal Thareja.
May 4, 2026 at 3:47 pm in reply to: node.add_fabnet() raises ResourceNotFoundError #9753
Komal Thareja
Moderator
Hi Arash,

Fix has been deployed on beyond bleeding edge container. Will be available in bleeding edge container later this evening. Please let me know if you run into any more issues. Apologies for the inconvenience.

Best,

Komal
May 4, 2026 at 2:06 pm in reply to: node.add_fabnet() raises ResourceNotFoundError #9752
Komal Thareja
Moderator
Hi Arash,

I’m looking at this will push out a fix soon.

Best,

Komal
April 27, 2026 at 9:51 am in reply to: Cannot allocate GPU + ConnectX-6 on same node #9727
Komal Thareja
Moderator
Portal view has been fixed too! Portal now shows the state of resources correctly.

Best,

Komal
April 26, 2026 at 4:30 pm in reply to: Cannot allocate GPU + ConnectX-6 on same node #9726
Komal Thareja
Moderator
Hi Bek,

Just a heads-up — the resource status on the portal isn’t quite matching the actual state of the resources right now. I’m working to get that sorted, but in the meantime you can use the fablib API to check availability and find an open slot for your target slice.

Here’s an artifact that should come in handy: https://artifacts.fabric-testbed.net/artifacts/e777ce3a-5b40-4e58-9666-7f31f655f03c

Best,

Komal
April 22, 2026 at 11:54 am in reply to: Request to Extend Slice Lease – unable to do it from portal #9702
Komal Thareja
Moderator
Hi Sree,

I’m investigating the extend/renew of this slice. That said, I’d strongly recommend backing up your data in the meantime — that way, if the slice ever needs to be recreated, you’ll have everything you need on hand.

Best,
Komal
Author

Posts

Viewing 15 posts - 1 through 15 (of 554 total)

1 2 3 … 35 36 37 →