👩‍💻
Event Fund
  • Bringing data science and AI/ML tools to infectious disease research
  • Session 1
    • Skills: open chemistry datasets
    • Breakout: working with chemistry data
  • Session 2
    • Skills: building an ML model for chemistry
    • Breakout: the Ersilia Model Hub
  • Session 4
    • Skills: using OS models
    • Breakout: generative models
  • Extra content
    • Git and Github
  • Documents
    • Tools
    • Glossary
    • Code of Conduct
    • Image and media policy
Powered by GitBook
On this page
  • Slack
  • Python
  • Pandas
  • Matplotlib
  • RDKit
  • Umap-learn
  • SciKit-learn
  • Google Colaboratory
  • Ersilia Model Hub
  • Command Line Interface
  • Git
  • GitHub
  1. Documents

Tools

PreviousGit and GithubNextGlossary

Last updated 2 years ago

Slack

Online chat open during and after the conference. Each participant should receive a private sign in to the #ai-workshop channel

Python

Programming language that we will use throughout the course. It is characterized for being an "object-oriented" language.

Pandas

to work with tabular data (i,e .xlsx, .csv ...). It is loaded into a Google Colab or a Jupyter Notebook with the following command (pd is the common pandas abbreviation):

import pandas as pd

Matplotlib

for creating all kinds of plots. It is loaded into a Google Colab or a Jupyter Notebook with the following command (plt is the common matplotlib abbreviation):

import matplotlib.pyplot as plt

RDKit

The largest chemioinformatics . Among many other functions, it allows the user to standardise molecules, draw chemical structures and create fingerprints for molecular representation. It is loaded into a Google Colab or a Jupyter Notebook with the following command:

import rdkit

RDKit is a very large package, and we usually import specific functions to work with:

from rdkit import Chem

Umap-learn

It is loaded into a Google Colab or a Jupyter Notebook with the following command:

import umap

SciKit-learn

import sklearn

While pandas and matplotlib come preinstalled in Google Colab, RDKit, UMAP and SKLEARN must be installed using the !pip install command

Google Colaboratory

Colab is not directly connected to the user's Google Drive, but this can easily be achieved by running the following command:

from google.colab import drive
drive.mount('/content/drive')

If you close Colab, or disconnect the runtime, any package installed will disappear so you will need to run all the relevant cells again.

Ersilia Model Hub

The Ersilia Model Hub is a platform of open source pretrained AI/ML models for drug discovery, developed and maintained by the Ersilia Open Source Initiative. It is licensed under a GPLv3 OS license.

Command Line Interface

Text based user interface used to run programs, manage computer files and interact with the computer. The default CLI in UNIX systems (Linux and MacOS) is called Terminal, in Windows is the Command Prompt or the Windows Power Shell

Git

GitHub

is a python package to perform dimension reduction with a Uniform Manifold Approximation and Projection (UMAP). It is very convenient to visualise, in our case, a dataset of chemical entities as a single 2D scatter plot, where each dot represents a molecule.

(sklearn) is a python package containing several algorithms to perform supervised and unsupervised machine learning.

Google Colaboratory (Colab) is a Jupyter notebook that allows users to write and execute Python code for free in Google cloud. It runs fully on Chrome and does not require additional installations, aside from a Google account and internet connection. If you need, you can also run Colab on your .

of available models

Open source

If you use the Ersilia Model Hub in your research, please .

Free and open source for distributed version control. It allows to track changes in any set of files, speeding up collaborative work. It needs to be installed in your local system.

for software development and version control based on Git. It is a platform where users can collaborate and contribute to open source projects.

Python package
Python package
Python Package
Umap-learn
Scikit-learn
local hardware
List
code
Documentation
cite us
software
Internet-hosting service