
ScienceIT will be hosting HPC workshops on January 31 and February 7. In partnership with UC Berkeley D-Lab, IT is also offering new training courses to Lab employees from January to March 2023.
HPC 101 Workshops with ScienceIT
The IT Division Scientific Computing Group (ScienceIT) will be hosting high-performance computing (HPC) virtual training sessions on January 31 and February 7. This is a great opportunity for new and prospective users to become acquainted with the Lawrencium supercluster and related tools.
Prerequisites:
- Registration is required for each session. Zoom links will be provided upon registration.
- Training is open to all Lab researchers and their collaborators.
- A Lawrencium User Account is preferred but not required.
- Join with your computer or laptop to participate in hands-on practice.
Please contact HPCS User Services at hpcshelp@lbl.gov if you have any questions.
Introducing High-Performance Computing (HPC) on Lawrencium
January 31, 2023, 3:00pm – 4:30pm
The first training session will provide a hands-on overview of the Lawrencium supercluster. The training will include the following topics:
- Overview of Lawrencium supercluster
- Getting Access and login to cluster
- Software access and installation
- Job submission and monitoring
- Data transfer to/from clusters
- Brief overview of Open on Demand
Overview of Open OnDemand and MyLRC portal
February 7, 2023, 3:00pm– 5:00 pm
The second training session will focus on introducing Open OnDemand web services on Lawrencium and discuss the practical aspects of MyLRC portal. The training will include the following topics:
Part I : OOD Applications
- Command-line shell access
- File management
- Interactive server and GUI applications, such Jupyter Notebook, Matlab and RStudio
- Full linux desktop streaming via web for GUI heavy jobs such as VMD, ParaView
- Customize Jupyter kernels
- Job management and monitoring
Part II : Using MyLRC portal
- Getting a user account on LRC
- Getting access to project
- Account deletion
- Requesting project account and management
IT Training with UC Berkeley D-Lab
Click on the course titles below to learn more. Register for courses through the LBNL portal.
Python Fundamentals: Parts 1-4
January 23, 2023, 10:00 pm to February 1, 2023, 1:00pm
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience.
Python Data Wrangling and Manipulation with Pandas
January 23, 2023, 2:00pm to 5:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with ‘relational’ or ‘labeled’ data both easy and intuitive.
R Fundamentals: Parts 1-4
January 24, 2023, 2:00pm to February 2, 2023, 5:00pm
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents.
Python Data Visualization
January 25, 2023, 2:00pm to 5:00pm
For this workshop, we’ll provide an introduction to visualization with Python. We’ll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.
Institutional Review Board (IRB) Fundamentals
February 7, 2023, 10:00am to 1:00pm
Are you starting a research project at UC Berkeley that involves human subjects? If so, one of the first steps you will need to take is getting IRB approval.
Python Machine Learning Fundamentals: Parts 1-2
February 7, 2023, 2:00pm to February 9, 2023, 5:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets.
Bash + Git: Introduction
February 8, 2023, 2:00pm to 5:00pm
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity.
R Data Wrangling and Manipulation: Parts 1-2
February 9, 2023, 10:00am to February 14, 2023, 1:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier.
Python Data Wrangling and Manipulation with Pandas
February 14, 2023, 10:00am to 1:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with ‘relational’ or ‘labeled’ data both easy and intuitive. It enables doing practical, real world data analysis in Python.
Excel Data Analysis: Introduction
February 14, 2023, 1:00pm to 4:00pm
This is a three-hour introductory workshop that will provide an overview of Excel, with no prior experience assumed. Attendees will learn how to use functions for handling data and making calculations, how to build charts and pivot tables, and more.
R Geospatial Fundamentals: Vector Data, Parts 1-2
February 16, 2023, 10:00am to February 21, 2023, 1:00pm
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The R programming language is a great platform for exploring these data and integrating them into your research.
Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP
February 16, 2023, 1:00pm to 4:00pm
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.
R Machine Learning with tidymodels: Parts 1-2
February 22, 2023, 1:00pm to March 1, 2023, 4:00pm
Machine learning often evokes images of Skynet, self-driving cars, and computerized homes. However, these ideas are less science fiction as they are tangible phenomena that are predicated on description, classification, prediction, and pattern recognition in data.
R Geospatial Fundamentals: Raster Data
February 23, 2023, 10:00am to 1:00pm
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The R programming language is a great platform for exploring these data and integrating them into your research.
Python Text Analysis Fundamentals: Parts 1-2
March 8, 2023, 2:00pm to March 15, 2023, 5:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.
Python Data Visualization
March 13, 2023, 2:00pm to 5:00pm
For this workshop, we’ll provide an introduction to visualization with Python. We’ll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.
Finding Health Statistics and Data
March 15, 2023, 12:00pm to 1:30pm
Participants in this workshop will learn about some of the issues surrounding the collection of health statistics, and will also learn about authoritative sources of health statistics and data.
Support
- Berkeley Lab staff may contact ittraining@lbl.gov for training feedback.
- All staff should contact the D-Lab Front Desk for trouble registering for courses, Zoom links, etc. at dlab-frontdesk@berkeley.edu.
- Visit it.lbl.gov/training for more courses offered by IT.