mdevaluate/doc/contributing.rst

245 lines
8.4 KiB
ReStructuredText
Raw Normal View History

2022-04-20 14:08:38 +02:00
Contributing
============
This document aims to lay out the basics of contributing code to the ``mdevaluate`` package.
The code is managed through a git repository, hence this guides gives basic information on the usage of `git <https://git-scm.com>`_.
Int this document the prefix ``$`` indicates commands which should be ran on a shell.
For a brief 15 min interactive tutorial visit `try.github.org <https://try.gitbhub.org>`_.
Let's start with a short introduction to the terminology.
Python code is organized in *packages* and *modules*:
Modules:
Any python file (e.g. ``test.py``) is called a module. A module can be imported (``import test``) an then used
in other python code if in the python path, for example the working directory.
In principle, importing a package means executing the code inside the file.
All definitions, like variables or functions, are then available under the modules name.
Packages:
Python modules can be grouped into packages. A python package is basically a folder,
which contains at least one mandatory file ``__init__.py``. This file is the entry
point into the module that is imported if the package is imported.
All modules in the folder are treated as submodules, which can be accessed via
a dot syntax, e.g. ``import package.test``. Packages can also contain sub packages.
A more `detailed explanation <https://docs.python.org/3/tutorial/modules.html>`_ can be found in the official python documentation.
Extending the documentation
+++++++++++++++++++++++++++
One of the most important parts of software is its documentation.
For modular packages like ``mdevaluate`` it's crucial to have a good coverage of the API,
since users need to know which functions are provided and how they are used.
To help others by extending the documentation is thereby a nice way of contributing to mdevaluate.
The documentation is generated with a third party tools named `Sphinx <http://www.sphinx-doc.org/en/stable/>`_.
The contents of the documentation are based on the source code (for the reference guide)
and documents written in the markup language *reStructuredText* (rst).
The source of every page can be viewed in the browser through the *View page source* link in the upper right of the page.
The name of the rst files can also be derived from the page URL.
The rst files are placed in the ``doc`` directory of the repository.
Extending the documentation can be done in different ways, e.g.
- Correct, clarify or extend existing sections
- Add new sections about the general use of mdevaluate
- Add use cases to the special topics section.
To add a new sections to special topics, first create a new file for this guide in ``doc/special``.
Then add the name of this file (without the .rst extension) to the toctree in the file ``special-topics.rst``.
Now write the guide in the newly created file.
Building the docs
-----------------
When you have made changes to the docs, first re-build them locally.
You will need to have the ``sphinx`` python package installed and of course a working environment for ``mdevaluate``.
When those requirements are fulfilled build the docs by:
1. Navigate to the ``doc`` directory
2. Run ``make html`` in the shell
3. View the produced html files in the browser: ``firefox _build/html/index.html``
Organization of the code
++++++++++++++++++++++++
The code for the evaluation software is organized in two python packages:
- ``pygmx``: This package provides a python wrapper for the Gromacs library and
thereby functionality to read file formats used within Gromacs.
- ``mdevaluate``: This package provides functionality for evaluation of molecular
dynamics simulations. It uses the ``pygmx`` package to read files, but is
(in theory) not limited to Gromacs data.
Submodules
----------
Below the content of the submodules of the package is described.
atoms.py
........
Definition of the ``Atom`` class and related functions for atom selection and information.
autosave.py
...........
Experimental functionality for automatic saving and loading of evaluated data,
like correlation functions. For each function call a checksum is calculated
from the input, which changes if the output of the function changes.
coordinates.py
..............
Definition of the ``Coordinates`` class and ``CoordinatesMap`` for coordinates
transformations and related functions.
correlation.py
..............
Functionality to calculate correlation functions.
distribution.py
...............
Functionality to calculate distribution functions.
reader.py
.........
Defines reader classes that handle trajectory reading and caching.
utils.py
........
A collection of utility functions.
Set up a development environment
++++++++++++++++++++++++++++++++
.. code-block:: console
$ git clone https://github.com/mdevaluate/mdevaluate.git
Organization of the repository
------------------------------
The repository is organized through git branches.
At the moment there exist two branches in the remote repository: *master* and *dev*.
Adding code to the repository
+++++++++++++++++++++++++++++
All changes to the code are done in your local clone of the repository.
If a feature is complete, or at least works, the code can be pushed to the remote,
to make it accessible for others.
A standard work flow to submit new code is the following
1. Fork the main repository o github and clone your fork to your local machine.
2. Create a new branch locally and apply the desired changes.
3. If the master branch was updated, merge it into the local branch.
4. Push the changes to github and create a pull request for your fork.
Pulling updates from remote
---------------------------
Before working with the code, the latest updates should be pulled for the master branch
.. code-block:: console
$ git checkout master
$ git pull
Create a new branch
-------------------
Before changing any code, create a new branch in your local repository.
This helps to keep an overview of all the changes and simplifies merging.
To create a new branch locally enter the following commands
.. code-block:: console
$ git checkout master
$ git branch my-feature
$ git checkout my-feature
First switch to the master branch to make sure the new branch is based on it.
Then create the new branch, called `my-feature` and switch to it.
Now you can start making changes in the code.
Committing changes
------------------
A bundle of changes in the code is called a *commit*.
These changes can happen in different files and should be associated with each other.
Let's assume, two files have been changed (``atoms.py`` and ``utils.py``).
The command
.. code-block:: console
$ git diff atoms.py
will show you all changes that were made in the file since the latest commit.
Before committing changes have to be *staged*, which is done by
.. code-block:: console
$ git add atoms.py utils.py
This my be repeated as often as necessary.
When all changes for a commit are staged, it can actually be created
.. code-block:: console
$ git commit
This will open up an editor where a commit message has to be entered.
After writing the commit message, save & close the file, which will create the commit.
Create Pullrequest
------------------
When all changes are made and the new feature should be made public, you can open a new pull request on github.
Most of the time, the master branch will have been updated, so first pull any updates
.. code-block:: console
$ git checkout master
$ git pull
When the master branch is up to date, it can be merged into the feature branch
.. code-block:: console
$ git checkout my-feature
$ git merge master
If no conflicting changes were made, merging works automatically.
If for example the same line was modified in a commit in master and your commits, a merge conflict will occur.
Git tells you which files have conflicts and asks you to resolve these.
The respective lines will be marked with conflict-resolution markers in the files.
The most basic way of resolving a conflict is by editing these files and choosing the appropriate version of the code.
See the `git documentation <https://git-scm.com/book/en/v2/Git-Branching-Basic-Branching-and-Merging#Basic-Merge-Conflicts>`_ for an explanation.
After resolving the conflict, the files need to be staged and the merge has to be committed
.. code-block:: console
$ git add utils.py
$ git commit
The commit message will be generated automatically, indicating the merge.
After merging, the changes can be pushed to the remote
.. code-block:: console
$ git push
The new code is now available in the remote.