Simple and Minimal Git Server Setup
Published 2020-12-31 on Yaroslav's weblog
The New Year is just around the corner, but there's still a little time to write one last post before the year ends. This time I want to write about my experience setting up a git server. I had already set up before a git server using a free and open source kind of clone of Github called Gitea, but there's a better way to setup a git server, especially if you just want to setup a personal Git server.
About a year ago I setup a git server using Gitea1 server together with a CI/CD system called Drone2. Back then it made sense, because I was working on some projects with a couple of friends, and I needed easy access for them to be able to work together with me. However, time went on, we stopped working on those projects, and I left that git server running for my own needs, i.e. to store the source of my website and deploy it automatically on each push to master.
It worked fine all around, but it was too bloated for my tastes, and I didn't really need most of the features that both Gitea and Drone had to offer. The containerized way of working using Docker containers for the servers themselves, and for the post-push hooks used, in my opinion, too many resources. And it turns out I was right.
I wanted to get rid of all of that, one, because I didn't really needed all of that; two, because I like minimalism, and try to avoid web interfaces and their clunky ways of working as much as possible; and three, because I wanted to get rid of the VPS in which they were hosted, and wanted to have it all on one VPS, the one where my website is being hosted, among many other things, such as my email server.
I already knew that there were some light-weight web front-ends for git, such as cgit3, or even git's own web interface; I didn't know what exactly they had though, so I started digging deeper into git itself, by reading the documentation, and looking in the internet for tutorials and explanations of how cgit and similar tools worked.
After learning more about git, and finding myself with more time than usual all thanks to COVID-194 and the government, I finally decided to just trying out setting up my minimal Git server.
Prelude, or some things to know before setting up a git server
Before trying to setup a git server this way, you should know some basic stuff about how git works. I didn't know a lot of these details when I embarked myself upon this venture, so I ended having to read a good deal of information on the way git works, which one of the reasons I really enjoyed this. Turns out git is a much better piece of software than I thought before, and I already had a really high opinion of it.
First, all of the git hosting solutions out there, Github, Gitlab, Gitea etc. are using none other that git, and some other like gitolite, as their main backend. Every time you push something to, say Github, be it by ssh or http, git gets called on their server. Github just shows you what git is telling them.
Second, you don't really need a web interface to have a git server. The web interface is just a nice way of showing your repositories to other people. A computer with ssh access is more than enough, if all you want to do is host your repositories.
Third, there are, in practice, two types of repositories:
- Standard repositories that contain all the information and actual files in the
.git
directory, together with a "working directory" at the root, which is the representation of the state of your projects in the ref you are currently, and where you actually, well, work on your files by editing them. This are not the actual files stored by git. - Bare repositories, which forgo the "working directory" and just contain all the git files and information at the root of the directory. This types of repositories are the ones meant for the server side of things.
Setting up the git backend
Now that you know a little bit more about how git works (or maybe you already knew that), it's time to set up the actual git server. All you need is a computer with access to the internet, preferably with a static IP (like a VPS), and optionally a domain name, just to make things prettier and easier. To guide myself through this process, I mostly followed the Arch Linux wiki guide on it5, and the official Git book chapter on Git on the server6.
First thing that you need to do, is setup ssh access. This is pretty easy to do, you most probably already have it setup, and it is out the scope of this tutorial, so I am not going to explain how to do it in this post.
Next, you need to make a user specially for git. This is quite important, since you don't want people with access (even just read access) to your repositories to have access to other parts of your server. For simplicity, we'll just call it git.
Each Linux distro (or Unix-like OS), has its own defaults, and sometimes even alternate ways to create users. For simplicity's sake I decided to use Debian's defaults when creating the git user:
# useradd git
# passwd git
These two commands, at least in Debian Buster, should create a normal user with
a home directory at /home/git/
and prompt you for a password for the git user.
Some people like to have the git home directory in a location such as
/srv/git/
, but I just decided to go with the defaults for simplicity's sake.
Now we need to setup some things for the git user. First we should switch to it:
$ su -l git
And once we are the git user, we should make a .ssh
folder with a
authorized_keys
file, which should contain your public key(s), and also the
keys of any people you plan on having write (push) access to your repositories.
Keep in mind, that any people with push access, will have push access to all
of the repositories under the git user.
Now we should think about where we are going to store the git repositories. You
could just store them at the root of the git home directory. However, I decided
to store them in a separate folder, since I also wanted to keep some (mainly
uninteresting) repositories private. So I created two folders private
, and
public
. Public, as its name implies, is going to be the directory with the
repositories that are going to be shown on the web interface for all the people
to see. I could have also created a separate web interface protected by password
for the private
repositories, but the web interface is mainly just for people
on the internet to take a glance at my repositories.
Now for a test, you can create your first repository under the public folder,
and try pushing into it. So we should cd into our public
directory (or
whatever directory you plan to store your repositories at) and init a bare
repository. By convention bare repository names end with a .git
, so it should
look something like this:
$ cd ~/public
$ git init --bare my-repo.git
Any time that you want to add a repo to your server, you'll basically need to login to your git user and repeat the steps above. Or, if you already have a non-empty server, you could clone it this way:
git clone --bare <path-or-url-to-my-repo> my-repo.git
Now you can push from your local computer to that directory using ssh this way:
$ git remote set-url origin git@your-domain-or-ip:public/my-repo.git
$ git push -u origin master
Or clone it, once you have something there:
$ git clone git@your-domain-or-ip:public/my-repo.git
Now you have working git server. It's that simple! However, right now there's now read-only access for other people to clone your repos. There's two ways you could solve that. You could use git's own smart HTTP, or the git daemon. In my case I didn't end up using git's smart HTTP, since I used cgit's functionality for that, but if you are interested in any of them, you can check out the official git book chapter that I mentioned earlier6.
Before we go further, though, we need to harden our git user a little bit, especially if you intend on giving other people push access. Right now, anybody with push access, basically has ssh access to your git user, since basically when you are pushing using this method, the only thing that is happening is that git is ssh'ing into your server and executing git (the one on your server) in your server. That's a big no-no. For that, we are going to change the standard login shell of git, to one that will allow git to function properly, but won't allow people to login to an full-fledged interactive shell into your git user using ssh.
Fortunately, git provides use with a limited shell for this specific case, called the git-shell. First, we need to know where it is located:
$ which git-shell
/usr/bin/git-shell
Now that we know its actual location, we need to change the default shell for our git user:
# chsh -s /usr/bin/git-shell git
Now you might think, well, how do I log into the git user to add/edit
repositories? Just log into your normal user, and then use su
to login with a
shell like sh, bash, or zsh:
$ su -s /bin/bash git
Setting up the web interface
Git provides its own web interface, which, if you want the minimalest of the minimalest, should be fine. I, however, decided to use cgit3, since I wanted a little more flexibility and more room for configuration. It's even the preferred web interface for the kernel.org git repositories.
To use it, we'll need not only it, but also a reverse proxy for it to work with it. My preferred program for that is nginx, I have used it for several years, and it love it for its ease of use and its performance. You could also use other software, like Apache, if that's what you prefer.
Even though I'm using Debian on my VPS, I mostly followed the Arch Linux Wiki article on cgit7. If you're not yet familiar with the Arch Wiki, you should definitely check it out. It's arguably the best resource on Linux out there.
So basically we need the following packages:
# apt install nginx fcgiwrap cgit
After that, we should go on and configure nginx. I won't go into big detail on
how to configure nginx itself, but the configuration file for cgit should be
located here /etc/nginx/sites-available/cgit
and look something like this:
server {
listen 80;
listen [::]:80;
server_name <your-git-domain>;
return 301 https://$host$request_uri;
}
server {
listen 443 ssl;
server_name <your-git-domain>;
ssl_certificate <your-ssl-cert-path>;
ssl_certificate_key <your-ssl-cert-key-path>;
root /usr/share/cgit;
try_files $uri @cgit;
location @cgit {
include fastcgi_params;
fastcgi_param SCRIPT_FILENAME /usr/lib/cgit/cgit.cgi;
fastcgi_param PATH_INFO $uri;
fastcgi_param QUERY_STRING $args;
fastcgi_param HTTP_HOST $server_name;
fastcgi_pass unix:/run/fcgiwrap.socket;
}
}
This configuration assumes that you have setup an SSL certificate. You can configure it just use plain HTTP, but I really recommend you get an SSL certificate.
Now that we have nginx configured we should start and enable both nginx and fcgiwrap.
At this point, you should be able to see cgit by going to the domain or url that you pointed at in the nginx config. However, it probably won't be showing you your repositories just yet, and that's because we need configure cgit, in part so that we can tell it where to look for our repos.
I won't go into much detail on how to configure cgit, but I'll show you my
configuration file, so you have an idea of how to configure yours. This file
should be located at /etc/cgitrc
:
#
# cgit config
# see cgitrc(5) for details
css=/cgit.css
logo=/cgit.png
virtual-root=/
max-repodesc-length=100
repository-sort=age
root-title=Yaroslav's git repositories
root-desc=Public repos for some of my projects, and mirrors of other projects.
# Possible readme files for about page
readme=:README.md
readme=:README.txt
readme=:README.rst
readme=:README
# Syntax highlighting
source-filter=/usr/local/lib/cgit/filters/syntax-highlighting.py
# About page rendering script
about-filter=/usr/local/lib/cgit/filters/about-formatting.sh
# Some common mimetypes for serving files raw
mimetype.gif=image/gif
mimetype.html=text/html
mimetype.jpg=image/jpeg
mimetype.jpeg=image/jpeg
mimetype.pdf=application/pdf
mimetype.png=image/png
mimetype.svg=image/svg+xml
# Set of snapshot formats for people to download
snapshots=tar.gz zip
# Git configuration
enable-http-clone=1
enable-commit-graph=1
remove-suffix=1
section-from-path=1
scan-path=/home/git/public
clone-prefix=https://git.yaroslavps.com git://git.yaroslavps.com
The most important bits here are:
virtual-root=/
enable-http-clone=1
scan-path=/home/git/public
clone-prefix=https://git.yaroslavps.com
You can read more about cgit's configuration in the Arch Wiki article I mentioned earlier8. I'll just say, that you should keep in mind that the order of the configuration variables is important. If something is not working as it should, its probably because you put some variable before another one, and their order should be changed. This gave a big headache when I was trying to configure cgit for myself.
One last thing worth mentioning, regarding cgit, is that you can configure each
repo for cgit by adding a cgitrc
file inside the bare repository in your
server. You can then add some more information about it like this:
owner=Yaroslav de la Peña Smirnov <yps@yaroslavps.com>
desc=Personal site and blog.
For more information, read cgitrc's manual:
$ man 5 cgitrc
Conclusion
This is one possible way of having your own hosting solution for git repositories, in a very minimal way. Since the cgit frontend is not running constantly, it's written in C, and is very minimal, it barely consumes any resources. This is also a great way to be more independent of centralized, proprietary services like GitHub. Of course, services like those are always great for visibility, but you could always just have mirror of your open source projects on them that points to your personal server.
There many other great things that you can do with a setup like this, like setting up git hooks, without having to set up a complex and complicated CI/CD system, for simple tasks, like in my case, pushing, generating and deploying my site on each push to master. For an example of a post-receive you can checkout mine here: https://git.yaroslavps.com/yaroslavps.com/tree/post-receive
For a little more information on git, I recommend you to read the following manual pages:
$ man git
$ man git-init
$ man git-clone
$ man git-config
$ man githooks
$ man git-send-email
$ man git-format-patch
The last are especially useful if you want to know how to collaborate with other people on other projects, or want other people to collaborate on yours, without having to resort to the centralized way of doing things of GitHub and similar sites. Git is especially well suited for collaborating through email, which is in and of itself a decentralized service. There's a guide on collaborating through email using git8, written by the great Drew DeVault, which is, in my opinion, one of the best FOSS programmers out there. I recommend also to check out his blog9, which I must say is much more interesting than mine.
A little unrelated note
As I write this post, this strange year is (finally) coming to an end. It wasn't a completely terrible year per se, but it did present us with a lot of nasty surprises. I have quite a lot of personal plans for this coming 2021, some of which involve this site. Some of them I'll keep a secret for now, but most importantly I want to diversify my writing a little bit more. I kind of already did that this year, with my "recipe book"10, but I also want to start writing about some other things on my weblog. So far it's been mostly about software development and computer-related stuff, but there are many other things not related to that on my mind, which I'd like to write about.
Here's to a New Year, that's not as Orwellian as this one, and may you accomplish as much as you set yourself to accomplish.
Happy New Year and Merry Christmas11 to all!
I'll write a little bit about my experience with the corona, and my thoughts on it next year.
I know, a little bit late for Catholic or Protestant folks out there. But just in time for (Russian) Orthodox Christmas ;)
https://www.yaroslavps.com/weblog/minimal-git-server/
© 2018—2024 Yaroslav de la Peña Smirnov.