1. Komal Thareja

Komal Thareja

Forum Replies Created

Viewing 15 posts - 31 through 45 (of 416 total)
  • Author
    Posts
  • in reply to: Slice submit via Jupyter get’s stuck #8197
    Komal Thareja
    Participant

      Hi,

      Both your slices  are in Stable State. It seems like a bug in fablib or a race condition which is causing fablib to think the slice is still Configuring.

      As a workaround, could you please do the following?

      I am trying to reproduce this at my end and would work to fix this. Apologies for the inconvenience!


      slice=fablib.get_slice(slice_name)
      slice.post_boot_config()
      slice.list_nodes();
      slice.list_interfaces();


      Slice Name: FRR-losa Slice ID: 0367f6f3-1331-49dc-9399-722616237a5b Project ID: a57c7715-d871-4369-82e6-408c9a57a6e7 Project Name: UCSD-FABRIC test
      Graph ID: 071abcd4-f292-449d-a69a-da4768780546
      Slice owner: { name: orchestrator, guid: orchestrator-guid, oidc_sub_claim: 91f5ecc3-16ff-4f09-95ac-dfeee0c3b1e3, email: jbalcas@es.net}
      Slice state: StableOK
      Lease time: 2025-02-07 14:24:38+00:00

      Thanks,

      Komal

      in reply to: Site to Site Connection Issue #8192
      Komal Thareja
      Participant

        Hi Raghav,

        Please set up your JH environment by running the notebook: jupyter-examples-rel1.8.1/configure_and_validate/configure_and_validate.ipynb

        This shall setup all the required configuration files and SSH keys. Please try the Wide Area Link notebook or Hello Fabric after that to ensure your configuration works. Please let us know if you run into issues.

        Thanks,

        Komal

        in reply to: Error message: strptime() argument 1 must be str, not None #8188
        Komal Thareja
        Participant

          Issue resolved over zoom meeting, the issue was the token.

          Token file only had id_token instead of the entire token contents. Downloading the token file and using that resolved the issue.

          Please let us know if you run into any other issues. I have taken a note to return a more user friendly error. We will address this in the next release.

          Thanks,

          Komal

          in reply to: Error message: strptime() argument 1 must be str, not None #8183
          Komal Thareja
          Participant

            Reached out to Vaneshi via email to request a meeting to work to resolve this!

            Thanks,

            Komal

            in reply to: Site to Site Connection Issue #8182
            Komal Thareja
            Participant

              Hi Raghav,

              The data plane interfaces on your VMs connected via L2STS do not have IP addresses configured.

              The enp3s0 interface on your VMs is designated as the management interface and should be used solely for SSH access. For your experiment, please use the data plane interfaces, which are enp7s0 on both VMs.

              I recommend exploring the JH example—Wide Area Link (Layer 2)—using manual, auto, or user-defined configurations, as it demonstrates how IP addresses should be set up.  Please, let us know if you encounter any further issues.

              Snapshot from the VMs:


              root@4f3a79fa-6e29-454e-9ec4-d1bfbda81a17-bapi-v2:~# ifconfig -a
              enp1s0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 9000
              inet 10.30.6.167 netmask 255.255.254.0 broadcast 10.30.7.255
              inet6 fe80::f816:3eff:fe82:7b9 prefixlen 64 scopeid 0x20
              inet6 2001:400:a100:3070:f816:3eff:fe82:7b9 prefixlen 64 scopeid 0x0
              ether fa:16:3e:82:07:b9 txqueuelen 1000 (Ethernet)
              RX packets 51778 bytes 150077282 (150.0 MB)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 25537 bytes 2608566 (2.6 MB)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


              enp6s0: flags=4098<BROADCAST,MULTICAST> mtu 1500
              ether 06:b7:27:d2:b5:0b txqueuelen 1000 (Ethernet)
              RX packets 0 bytes 0 (0.0 B)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 0 bytes 0 (0.0 B)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


              lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536
              inet 127.0.0.1 netmask 255.0.0.0
              inet6 ::1 prefixlen 128 scopeid 0x10
              loop txqueuelen 1000 (Local Loopback)
              RX packets 178 bytes 23663 (23.6 KB)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 178 bytes 23663 (23.6 KB)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


              root@bd65ee61-46a2-4cb2-b89e-c6b385052336-bapi-vm1:~# ifconfig -a
              enp3s0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 9000
              inet 10.20.5.38 netmask 255.255.254.0 broadcast 10.20.5.255
              inet6 fe80::f816:3eff:fe55:c84f prefixlen 64 scopeid 0x20
              ether fa:16:3e:55:c8:4f txqueuelen 1000 (Ethernet)
              RX packets 15231 bytes 146475806 (146.4 MB)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 13258 bytes 1020159 (1.0 MB)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


              enp7s0: flags=4098<BROADCAST,MULTICAST> mtu 1500
              ether 16:8a:89:5e:75:97 txqueuelen 1000 (Ethernet)
              RX packets 0 bytes 0 (0.0 B)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 0 bytes 0 (0.0 B)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


              lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536
              inet 127.0.0.1 netmask 255.0.0.0
              inet6 ::1 prefixlen 128 scopeid 0x10
              loop txqueuelen 1000 (Local Loopback)
              RX packets 238 bytes 37767 (37.7 KB)
              RX errors 0 dropped 0 overruns 0 frame 0
              TX packets 238 bytes 37767 (37.7 KB)
              TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0

              Thanks,

              Komal

              in reply to: Unable to reserve slice #8166
              Komal Thareja
              Participant

                Hi Kriti,

                I think the attachment got lost. Could you please email it to me directly at kthare10@renci.org?

                Thanks,

                Komal

                in reply to: Unable to reserve slice #8162
                Komal Thareja
                Participant

                  Hi Kriti,

                  I can verify that your slice is StableOK. Please check the portal to verify that.

                  This looks like a bug, where fablib may be reflecting the stale state. If possible, could you please share your notebook, so I can try to reproduce this on my end and fix the bug? Appreciate your help with this.


                  Slice Name: Eibp_large_PRIN Slice ID: 02ff4c5b-140e-4e2a-ab2a-02ca8bd45ca3 Project ID: 787adfc9-d37e-42f2-8efe-8e32793e0bb8 Project Name: Expedited Internet Bypass Protocol
                  Graph ID: 53e16605-016c-4e2a-8754-336effb71188
                  Slice owner: { name: orchestrator, guid: orchestrator-guid, oidc_sub_claim: 5cf08403-b13e-458a-a17d-ee69abdacffa}
                  Slice state: StableOK
                  Lease time: 2025-02-05 18:27:56+00:00

                  Thanks,

                  Komal

                  in reply to: Unable to reserve slice #8160
                  Komal Thareja
                  Participant

                    Hi Kriti,

                    Could you please share your slice id?

                    Thanks,

                    Komal

                    in reply to: Error message: strptime() argument 1 must be str, not None #8157
                    Komal Thareja
                    Participant

                      Hi Vaneshi,

                      I am unable to reproduce this with any of the JH containers. I do notice a small error in the API posted above. quiet parameter is a boolean.

                      Could you please check the following?

                      • You have a valid token in ~/.tokens.json
                      • Try the snippet below


                      from fabrictestbed_extensions.fablib.fablib import FablibManager as fablib_manager
                      fablib = fablib_manager()
                      cx5_column_name = 'nic_connectx_5_available'
                      cx6_column_name = 'nic_connectx_6_available'
                      sites_connectx_json = fablib.list_sites(
                      output="json",
                      quiet=True,
                      filter_function=lambda x: x[cx6_column_name] > 0 or x[cx5_column_name] > 0,
                      latlon=False,
                      )
                      print(sites_connectx_json)

                      Please let me know if you still run into errors.

                      Thanks,

                      Komal

                      in reply to: Site to Site Connection Issue #8146
                      Komal Thareja
                      Participant

                        Hi Raghav,

                        Could you please check if the interfaces on the VMs have the IP addresses configured?

                        Also, please share the Slice ID for your slice. This will help us take a look at it as well.

                        Thanks,

                        Komal

                        in reply to: Network reconfig after numa_tune #8142
                        Komal Thareja
                        Participant

                          Hi Ilya,

                          Thanks for the kind words—we appreciate the feedback!

                          To ensure the network interfaces retain their configuration after rebooting, please use the following code to reconfigure all nodes in the slice:

                          for n in slice.get_nodes():  
                              n.config()
                          

                          This will restore the network configurations.

                          Regarding DPUs, we’re currently exploring BlueField 2 DPUs and targeting summer for initial support, with more details coming soon. Stay tuned for updates!

                          Please let us know if you run into any other issues.

                          Best Regards,

                          Komal

                          in reply to: Cannot view slice topology through My Slices list #8135
                          Komal Thareja
                          Participant

                            Hi Sourya,

                            It appears that your slice utilizes a Port Mirror service, which may not yet be supported by the Slice Viewer. We will check with Yaxue, who works on the Portal, to confirm this. We are working on adding support for this feature in the next release. Apologies for any inconvenience.

                            Thanks,

                            Komal

                            in reply to: Hardware Steering – Connectx6 #8134
                            Komal Thareja
                            Participant

                              Hi Tanay,

                              Based on the details of the ConnectX-6 from one of the FABRIC VMs and the DPDK documentation you shared (DPDK MLX5 Guide), it appears that the ConnectX-6 available in FABRIC is not supported. We are currently working on integrating new BlueField DPUs, which may provide a suitable solution.


                              [root@Node2 ~]# lspci | grep X-6
                              06:00.0 Ethernet controller: Mellanox Technologies MT28908 Family [ConnectX-6]
                              07:00.0 Ethernet controller: Mellanox Technologies MT28908 Family [ConnectX-6]
                              [root@Node2 ~]# mstconfig -d 06:00.0 query
                              Device #1:
                              ----------
                              Device type: ConnectX6
                              Name: MCX653106A-ECA_Ax
                              Description: ConnectX-6 VPI adapter card; H100Gb/s (HDR100; EDR IB and 100GbE); dual-port QSFP56; PCIe3.0 x16; tall bracket; ROHS R6
                              Device: 06:00.0

                              Thanks,

                              Komal

                              in reply to: Site to Site Connection Issue #8125
                              Komal Thareja
                              Participant

                                Hi Rajiv,

                                L2STS links should work with SharedNICs. Could you please share your slice details where this is not working?

                                Thanks,

                                Komal

                                in reply to: Permission denied for in-slice port mirroring #8113
                                Komal Thareja
                                Participant

                                  I also just noticed the attached screenshot. The Permission error is related to SmartNIC usage. To use SmartNICs for your project, you still need the Component.SmartNIC permission.

                                  Thanks,
                                  Komal

                                Viewing 15 posts - 31 through 45 (of 416 total)