So I thought logging in Python would be easy. You start logging to the command line then when that gets too much you log to a file. A bit of me still feels logging to a file in the app directory is the easiest set up. But various articles on the Internet say file logging … Continue reading Logging a Python App to journald
And other Linux delights. Often there is a need to configure processes to start on startup and to restart on failure. On Linux systems this process is controlled via the systemd init system and system manager. Now, as with most things Linux, there are a lot of sources of information out there. Several of them … Continue reading Running Scripts at Startup
Twitter rambles about explanations and AI. Maybe to spruce up at a later date.
I've loved spaCy for a long time but I've only just got my head around how you can structure a text processing pipeline to take full advantage of its power.
This is a short post explaining how to obtain over 50,000 text books for your natural language processing projects. The source of these books is the excellent Project Gutenberg. Project Gutenberg offers the ability to use sync the collection of books. To obtain the collection you can set up a private mirror as explained here. … Continue reading Getting All the Books
Or understanding how to best use Docker. Docker is a great way to build services with modular and changeable components without borking your server / computer. I like to think of Docker containers as a system version of Python's virtual environment - you can build a stack of services and applications through a Docker file, … Continue reading Taming the Docker Blob
This is a quick post intended to help those trying to understand convolution as applied in Tensorflow. There are many good blog posts on the Internet explaining convolution as applied in convolutional neural networks (CNNs), e.g. see this one by Denny Britz. However, understanding the theory in one thing, knowing how to implement it is … Continue reading Understanding Convolution in Tensorflow
Recently I've been playing around with the last 15 years of patent publications as a 'big data' source. This includes over 4 million individual documents. Here I thought I'd highlight some problems I faced. I found that a lot of academic papers tend to ignore or otherwise bypass this stuff.
This is just a quick post to remind me of the steps to resolve errors on an Apache-served Flask app. I'm using Anaconda as I'm on Puppy Linux (old PC) and some compilations give me errors. Stuff in square brackets is for you to fill in. Log into remote server (I use ssh keys): ssh -p … Continue reading Fixing Errors on Apache-Served Flask Apps
It just so happens that on a few systems I have been using Anaconda to allow painless Python coding. For example, on Windows or non-Debian Linux I have struggled to compile packages from source. Anaconda provides a useful wrapper for the main functionality that just works on these operating systems (on my Ubuntu machine or the … Continue reading Starting a Python Project with Anaconda