1. Operations on Slices taking time

Operations on Slices taking time

Home Forums FABRIC General Questions and Discussion Operations on Slices taking time

Viewing 7 posts - 1 through 7 (of 7 total)
  • Author
    Posts
  • #9802
    Nirmala Shenoy
    Participant

      Hello

      I am working onĀ  site “TACC” and any commands I give to the remote nodes are taking time /just hanging there. I worked on this slice yesterday and did not have problems.

      I am in the process of reserving resources in “HAWI”. The reservation has now been going for more than 1 hour – it is stuck at running post_boot_config.

      thanks

      Nirmala

      #9803
      Komal Thareja
      Moderator

        Hi Nirmala,

        Could you please check if you see any errors in /tmp/fablib/fablib.log ?

        Another thing for the Post Boot config delays could be expired bastion keys. Please run the notebook jupyter-examples-*/configure_and_validate/configure_and_validate.ipynb This shall renew your bastion keys if they are expired.

        Please remember to update the Project ID in this notebook.

        Please let me know in case you continue to run into issues.

        Best,

        Komal

        #9804
        Nirmala Shenoy
        Participant

          the bastion key has not expired Komal. I am able to ssh into some nodes in the TACC site, but looks like I lost access to some nodes.

          I am not seeing a tmp/fablib directory

          thanks

           

          #9805
          Komal Thareja
          Moderator

            Hi Nirmala,

            We’re currently investigating what appears to be an unplanned outage on our Kafka service, which may be causing the slowness you’re experiencing. We are actively working on recovery, and I’ll keep you updated on our progress.

            Best, Komal

            #9807
            Nirmala Shenoy
            Participant

              thanks Komal

              #9810
              Komal Thareja
              Moderator

                Thank you for reporting this Nirmala! System has been recovered. Please let us know if you continue to run into issues.

                Best,

                Komal

                #9814
                Nirmala Shenoy
                Participant

                  thanks Komal. I will be testing them tomorrow.

                Viewing 7 posts - 1 through 7 (of 7 total)
                • You must be logged in to reply to this topic.