1. Komal Thareja

Komal Thareja

Forum Replies Created

Viewing 15 posts - 61 through 75 (of 466 total)
  • Author
    Posts
  • in reply to: Slice key issue #8331
    Komal Thareja
    Participant

      Hi Mike,

      This could possibly be because your Bastion Key has expired. Could you please try running the  notebook jupyter-examples-rel1.8.1/configure_and_validate/configure_and_validate.ipynb again?

      This should renew the keys. If error persists, could you please share if you see any errors in /tmp/fablib/fablib.log

      Thanks,

      Komal

      • This reply was modified 5 months, 3 weeks ago by Komal Thareja.
      Komal Thareja
      Participant

        Hi Tejas,

        Could you please share your slice id?

         

        Thanks,

        Komal

        in reply to: Facility ports #8285
        Komal Thareja
        Participant

          Glad to hear it worked, Mike! The default container should function properly as well—I’ll give it a try.

          The Bleeding Edge container includes the latest released software versions, so I’d recommend using that.

          Beyond Bleeding Edge contains the newest software from the main branch, including ongoing development changes, which may introduce some bugs.

          Thanks,

          Komal

          in reply to: Facility ports #8281
          Komal Thareja
          Participant

            Hi Mike,

            I just tried the example notebook you mentioned. It worked for me using the Beyond Bleeding Edge Container. Could you please share which JH container are you using? Also, if possible share the Slice ID where you see this error or which Facility Ports you ran into this error with?

             

            Thanks,

            Komal

            in reply to: Cannot login to SALT node from my local machine #8270
            Komal Thareja
            Participant

              Hi Sourya,

              There was a power outage at SALT and is being recovered. We will let you know as soon as the recovery is complete.

              Thanks,

              Komal

              in reply to: Hardware Steering – Connectx6 #8269
              Komal Thareja
              Participant

                Hi Tanay,

                We are targeting a release for either Summer or Fall and will share more details once our plans are finalized.

                Thanks,

                Komal

                in reply to: Node is closed, cannot renew #8264
                Komal Thareja
                Participant

                  Hi Sankalpa,

                  Based on our logs, this slice has been renewed three times in descending order of time. During the renewal attempt on 2025-02-15 13:46:54,320, Client3 (be97d870-3299-418e-ba17-a1ddcab06bdb) could not be renewed because a required component was likely allocated to another future slice. Since requesting slices in the future is enabled, this allocation prevented the renewal of that particular VM. However, other available resources were successfully extended.

                  The lease for Client3 ended on 2025-02-19 05:05:37 UTC, after which the VM was closed/deleted. The latest renewal request was issued on 2025-02-19 22:04:39, which was after the lease expiration, making the VM ineligible for renewal.

                  Logs:

                  Reservation Details:

                  • Reservation ID: be97d870-3299-418e-ba17-a1ddcab06bdb
                  • Slice ID: 6acbc4aa-4b6e-44e3-b7c0-8c2f33de46c4
                  • Resource Type: VM
                  • Status: Closed (Last update: *Insufficient resources – Renew failed: Component of type ConnectX-6 with PCI Address 0000:a1:04.7 is already allocated to another reservation on node GDXYNF3).

                  Start: 2025-01-24 17:18:11 UTC
                  End: 2025-02-19 05:05:37 UTC
                  Requested End: 2025-02-28 13:46:54 UTC

                  Let me know if you need further clarification.

                  Best,

                  Komal

                  in reply to: Half Slice Unreachable #8261
                  Komal Thareja
                  Participant

                    Hi Luca,

                    I reviewed your slice and noticed that during the last renewal, two of the VMs could not be renewed due to insufficient resources. As a result, they were not extended and have now transitioned to a Closed state, meaning they have been deleted. Below is a snapshot for reference.

                    Please note that since users can request slices for future use, it’s possible that renewing an active slice may fail if resources are already reserved for future allocations.

                    You can check the current state of your slice using the following code:

                    from fabrictestbed_extensions.fablib.fablib import FablibManager as fablib_manager
                    fablib = fablib_manager()
                    slice_name = "Slice INT slice - DALL + LOSA"
                    slice = fablib.get_slice(slice_name)
                    slice.list_nodes()
                    

                    Additionally, if the renewal was triggered via JupyterHub (JH), this information has also been communicated to you there.


                    Reservation ID: b0ff1824-011d-4225-a748-371ddf6eb5e4 Slice ID: 831a0115-8e9e-4854-bbcb-d12022a878aa
                    Resource Type: VM Notices: Reservation b0ff1824-011d-4225-a748-371ddf6eb5e4 (Slice INT slice - DALL + LOSA(831a0115-8e9e-4854-bbcb-d12022a878aa) Graph Id:f6a2c692-b430-4a01-95d6-2f2343320dea Owner:s317694@studenti.polito.it) is in state (Closed,None_) (Last ticket update: Insufficient resources : ['ram', 'disk'])


                    Reservation ID: dc4456ff-f768-4915-8c7e-97696b2fcc21 Slice ID: 831a0115-8e9e-4854-bbcb-d12022a878aa
                    Resource Type: VM Notices: Reservation dc4456ff-f768-4915-8c7e-97696b2fcc21 (Slice INT slice - DALL + LOSA(831a0115-8e9e-4854-bbcb-d12022a878aa) Graph Id:f6a2c692-b430-4a01-95d6-2f2343320dea Owner:s317694@studenti.polito.it) is in state (Closed,None_) (Last ticket update: Insufficient resources : ['ram', 'disk'])

                    Thanks,

                    Komal

                    1 user thanked author for this post.
                    in reply to: Iperf3 error on FABnet – No path to nodes #8255
                    Komal Thareja
                    Participant

                      Hi,

                      I was able to run the notebook .

                      Could you please share your Slice ID?

                      Additionally, could you post your inquiries in the FABRIC General Questions and Discussion forum?

                      Thanks,

                      Komal

                      in reply to: hawi site ssh problem. #8240
                      Komal Thareja
                      Participant

                        Hi Yuanjun,

                        Could you please try SSH into your VMs now? We were able to successfully SSH to the VMs.

                        Additionally, could you post your inquiry in the FABRIC General Questions and Discussion forum?

                        Thanks,

                        Komal

                        in reply to: STAR SITE PROBLEM #8239
                        Komal Thareja
                        Participant

                          Hi Yuanjun,

                          Unfortunately, the STAR resources you’re requesting are currently in use. Please try again later or consider scheduling your slices in advance using the notebook.

                          Additionally, could you post your inquiry in the FABRIC General Questions and Discussion forum?

                          Thanks,
                          Komal

                          in reply to: GPU node is not available on MAX site. #8238
                          Komal Thareja
                          Participant

                            Hi Yuanjun,

                            Unfortunately, the MAX resources you’re requesting are currently in use. Please try again later or consider scheduling your slices in advance using the notebook.

                            Additionally, could you post your inquiry in the FABRIC General Questions and Discussion forum?

                            Thanks,
                            Komal

                            in reply to: Slice submit via Jupyter get’s stuck #8211
                            Komal Thareja
                            Participant

                              Glad to hear that worked! We will work to address this and add support to interrupt/return meaningful error in such cases.

                              Thanks,

                              Komal

                              in reply to: Unable to reserve slice #8208
                              Komal Thareja
                              Participant

                                Hi Kriti,

                                Could you please re-run this notebook: jupyter-examples-rel1.8.1/configure_and_validate/configure_and_validate.ipynb ?
                                This shall renew any expired keys. Please try your slice again after this. I want to rule out any SSH errors. If you continue to see the error, please share /tmp/fablib/fablib.log

                                Thanks,
                                Komal

                                in reply to: Slice submit via Jupyter get’s stuck #8207
                                Komal Thareja
                                Participant

                                  Authentication failed would explain the SSH errors you are observing. Could you please re-run this notebook: jupyter-examples-rel1.8.1/configure_and_validate/configure_and_validate.ipynb ?
                                  This shall renew any expired keys. Please try your slice again after this.

                                  Thanks,
                                  Komal

                                Viewing 15 posts - 61 through 75 (of 466 total)