Forum Replies Created
-
AuthorPosts
-
Could you please check
/tmp/fablib/fablib.log
for any errors? Also could you please check if you are able to SSH to the VMs in your slice from command line?Also, please share your notebook if possible to help us recreate this in our environment.
Thanks,
Komal
-
This reply was modified 1 year, 1 month ago by
Komal Thareja.
This typically happens, when your SSH keys are not setup properly. By default, fablib executes some default post boot config tasks to configure IP addresses if the notebook setup for
auto
config is used.Could you please check if your bastion keys are not expired and sliver keys are setup correctly?
Validation of keys and configuration can also be done via
configure_and_validate.ipynb
notebook available underjupyter-examples-*
directory. This notebook was added recently and should be available underjupyter-examples-main
orjupyter-examples-1.6.1
Please let us know if the issue persists.
Thanks,
Komal
-
This reply was modified 1 year, 1 month ago by
Komal Thareja.
@Acheme – Thank you for providing the solution for this issue. Appreciate it!
I also wanted to add that Apache Web Server notebook was a bit outdated and not inline with latest Fablib API. I have updated the notebook so that it uses latest Fablib APIs which take care of the networking as well and does not rely on user explicitly configuring IP addresses.
If needed, please try the latest version of the notebook from Github: https://github.com/fabric-testbed/jupyter-examples/blob/main/fabric_examples/complex_recipes/apache_web_server/apache_web_server.ipynb
Thanks,
Komal
-
This reply was modified 1 year, 1 month ago by
Komal Thareja.
Should be resolved now. I just tried a slice and it worked.
Thanks,
Komal
Hi Acheme,
We are noticing intermittent connectivity issues with TACC switch, resulting in the Network Services failing to provision. That’s resulting the Network being set to None. We are working to resolve this issue and will keep you posted.
Thanks,
Komal
Hi Acheme,
Thank you for reporting this issue. TACC worker
tacc-w1
where VMs were spawned had issues. We have recovered the worker and provisioning should work now.Thanks,
Komal
January 23, 2024 at 10:14 am in reply to: exception error when reserving resources for a slice #6393Yes either of the following should work.
Spring 2024 container –
fabrictestbed-extensions==1.6.0
orFall 2023 container –
fabrictestbed-extensions==1.5.6
Thanks,
Komal
Hi Nirmala,
Which JH container are you using? This error typically happens due to a version mismatch for Fablib. Please see below:
Fablib – if you are still using Release 1.5, please update fablib using the command:
pip install fabrictestbed-extensions==1.5.6
Also, if you face issues running notebooks on JH, please ensure there are no entries for
fabrictestbed-extensions
infabric_config/requirements.txt
. If you have any entries there, Please remove them and restart your JH container.Thanks,
Komal
January 22, 2024 at 9:04 pm in reply to: Maintenance on Network AM – 01/22/2024 (3:00pm-4:00pm EST) #6384Network model has been updated and maintenance is complete!
@Robin, could you please try to restart your JH container via File -> Hub Control Panel -> Stop My container -> Start My container when you see this error?
Also, the path to replace the token is
/home/fabric/.tokens.json
. We will fix this in the documentation of it’s incorrect. Also, could you please let us know which JH container are you using?Thanks,
Komal
@Lyod – Fabnetv4Ext notebook has a bug and configures the route incorrectly. We will fix the notebook, sharing the fix needed to the routes below. Hope this helps!
Configure Node1 cell in notebook should change the route as below via EXT gateway, ping should work.
stdout, stderr = node1.execute(f'sudo ip route add 0.0.0.0/0 via {network1.get_gateway()}')
Configure Node2 cell in notebook should change the route as below via EXT gateway, ping should work.
stdout, stderr = node2.execute(f'sudo ip route add 0.0.0.0/0 via {network2.get_gateway()}')
Thanks,
Komal
-
This reply was modified 1 year, 1 month ago by
Komal Thareja.
Hi Shams,
Could you please remove <> enclosing the project id in
/home/fabric/work/fabric_config/fabric_rc
and restart your JH container via File -> Hub Control Panel -> Stop Container followed by Start Container?Please try your notebook again and let us know if you still observe this error.
Thanks,
Komal
STAR site has 6 worker nodes each with 128 cores = 768 cores. This is same as the previous release.
Oversubscription is not enabled on STAR.
January 8, 2024 at 3:09 pm in reply to: How to access the files from my older username on the same project? #6261@Nagmat – Your back should be available in your new JH container as
fabric_bkp.tgz
. Please start your container and let us know if you face any issues accessing the data.Thanks,
Komal
January 8, 2024 at 2:56 pm in reply to: How to access the files from my older username on the same project? #6259@Nagmat – Could you please stop your JH container? I took backup of your old files and would copy it your new container.
-
This reply was modified 1 year, 1 month ago by
-
AuthorPosts