Twitter rambles about explanations and AI. Maybe to spruce up at a later date.
I've loved spaCy for a long time but I've only just got my head around how you can structure a text processing pipeline to take full advantage of its power.
This is a short post explaining how to obtain over 50,000 text books for your natural language processing projects. The source of these books is the excellent Project Gutenberg. Project Gutenberg offers the ability to use sync the collection of books. To obtain the collection you can set up a private mirror as explained here. … Continue reading Getting All the Books
Or understanding how to best use Docker. Docker is a great way to build services with modular and changeable components without borking your server / computer. I like to think of Docker containers as a system version of Python's virtual environment - you can build a stack of services and applications through a Docker file, … Continue reading Taming the Docker Blob
This is a quick post intended to help those trying to understand convolution as applied in Tensorflow. There are many good blog posts on the Internet explaining convolution as applied in convolutional neural networks (CNNs), e.g. see this one by Denny Britz. However, understanding the theory in one thing, knowing how to implement it is … Continue reading Understanding Convolution in Tensorflow
Recently I've been playing around with the last 15 years of patent publications as a 'big data' source. This includes over 4 million individual documents. Here I thought I'd highlight some problems I faced. I found that a lot of academic papers tend to ignore or otherwise bypass this stuff.
This is just a quick post to remind me of the steps to resolve errors on an Apache-served Flask app. I'm using Anaconda as I'm on Puppy Linux (old PC) and some compilations give me errors. Stuff in square brackets is for you to fill in. Log into remote server (I use ssh keys): ssh -p … Continue reading Fixing Errors on Apache-Served Flask Apps
It just so happens that on a few systems I have been using Anaconda to allow painless Python coding. For example, on Windows or non-Debian Linux I have struggled to compile packages from source. Anaconda provides a useful wrapper for the main functionality that just works on these operating systems (on my Ubuntu machine or the … Continue reading Starting a Python Project with Anaconda
One thing I've found hard about programming in Python is the jump from small scripts or iPython (now Jupyter) notebooks to fully functional programs. Many examples and online tutorials only require a single ".py" file or a series of command line or notebook entries. However, as you get more advanced and start looking at complete Flash applications or … Continue reading Quick Post: Structuring a Python Program
Or how to get very quickly write-restricted by Twitter. This is a short guide to playing around with the Twitter API using Python on a Raspberry Pi (or any other Linux machine). Overview The process has four general steps: - Setup a new Twitter account and create a new Twitter app; Setup the Raspberry Pi … Continue reading Twitter Robots on a Raspberry Pi