A Short Reference of Python Logging

2016-05-26 by terryoy, in tricks

I have for many times use the logging function, but never understand it completely. So I go through the document and make some notes, hoping it will help me use it more quickly in the future.

1. Basic Config

If you want to use a programmable method other than a configuration file, the basicConfig() method is the general initializing method.

The most basic form is default log, which you don't need basicConfig(). It is using console output with WARNING level.

>>> import logging
>>> logging.debug('hello') # no output
>>> logging.warn('world')
world

The basic config contains a list of below elements:

filename - Using a FileHandler to output the log
filemode - file open mode('r', 'w', 'a'), mainly used to choose append or write a new log file
format - A string for specifying the log output template, If you want to lookup a list of supported keywords, look for section 'LogRecord attributes' in the python official document
datefmt - A specified date/time format.
level - set the root loglevel for the logger
stream - Specify a stream for the StreamHandler, for example, a buffer output stream or stdout. It will be ignored if “filename” is present.

The logger can be initialized only once when basicConfig() is called. Then

>>> import logging
>>> logging.basicConfig(filename='program.log', 
        filemode='a',
        format="%(asctime)-15s %(levelname)s [%(module)s] %(message)s",
        datefmt="%Y-%m-%d %H:%M:%S.%f",
        level=logging.DEBUG)

2. Configuration Object and the Modular Approach

When choosing the Modular Approach of logging, you need to deal with 4 elements:

loggers - the interface that application modules used to log things
handlers - send the log records (that loggers created) to the appropriate destinations
filters - provide a finer grained facility for determining which log records should be output
formatters - specify the layout of the log records in the final output

2.1 Logger hierarchy

The loggers used by all the modules are formed in a conceptual hierarchy by the naming with a separator('.'). For example: 'abc.text', is the descendant of logger 'abc', while 'abc' can be the parent of 'abc.text', 'abc.pdf', 'abc.image', etc. A good convention is to use loggers in a module sense, using in each .py as below:

logger = logging.getLogger(__name__)

The root of all loggers is called the “root” logger, which prints the logger name as “ROOT” in output.

2.2 Useful handlers

There are some useful handlers in the section of python Logging Howto document. Some of them are listed as here:

StreamHandler - to stream object (default stdin?)
FileHandler - to a disk file
RotatingFileHandler - from BaseRotatingHandler, send logs to files, rotating log file with a maximum file size.
TimedRotatingFileHandler - from BaseRotatingHandler, send logs to files, rotating log file at a certain timed intervals.
SocketHandler/DatagramHandler - send log messages to TCP/IP and UDP sockets
SMTPHandler - Send to a designated email address
NullHandler - Do nothing, it's used in development that supports logging with this mock

This shows the variety of logging output scenarios, which you could look them up in the python doc.

2.3 The propagation of loggers

Look at the flow of logging in the below diagram from python's tutorial,

logging flow

When a log record is send to the logger in the module, it will first check if its own filter(the filter of a logger) reject it, then pass to its handler; if propagation is set to true(by default), it will pass the log record to it's parent too, so the log record will bubble up till the root logger, and each logger will judge by their handler and filter to decide whether to output the log record. So we often setup a top level logger, and then configure a child logger only if needed.

2.4 Configuring Logging

The most usual approaches are using fileConfig() and dictConfig(). With fileConfig() you can use a .conf file to load the settings (this approach is deprecated), and with dictConfig() you can use even wider range of persistence choices, such as JSON, python file, yaml, etc.

For example, I have written a small utils for command line interaction and also want to log the HTTP request details. So I defined two handlers: one for console output, another for file output so that I can review the details. The console output must be simple without unneccessary information, and the file output should contains all the time, module details for investigation. Here is my configuration using a python file. (The advantages for a python configuration is that you can also use expressions and comments.)

import logging, logging.config

config = {
    "log_config": {
        "version": 1,
        "formatters": {
            "brief": {
                "format": "%(message)s",
            },
            "detail": {
                "format": "%(asctime)-15s %(levelname)s [%(name)s.%(funcName)s] %(message)s",
                "datefmt": '%Y-%m-%d %H:%M:%S',
            },
        },
        "handlers": {
            "console": {
                "class": "logging.StreamHandler",
                "level": "INFO",
                "formatter": "brief",
            },
            "file": {
                "class": "logging.handlers.RotatingFileHandler",
                "filename": "dev.log",
                "level": "DEBUG",
                "formatter": "detail",
            },
        },
        "root": {
            "handlers": ["console", "file"],
            "level": "DEBUG",
        },
        "loggers": {
            "requests": {
                "handlers": ["file"],
                "level": "DEBUG",
                "propagate": False,
            }
        },
    },
}

logging.config.dictConfig(config["log_config"])

There are two formatters: “brief” for simply output the message body, the root loggers is default logger for all the modules I write, and also the 3rd party libraries like python requests. Since all the info log should appear in both console and the file, I need to put both inthe root logger. However, to avoid the unneccssary debug log showing in console, I set the level INFO in the console handler. This enables the file logger logs everything while the console doesn't.

Next I discover that the library “requests” also have some “INFO” log which is unneccessary in console, so I will specificially make it disappear using the loggers config. The important thing here is to use the propagate feature.

The “file” logger wants the requests' debug log, so I need to set the level to DEBUG. By default, it will propagate the log record to the “root” logger which make it appear to console. So I will use propagate: False to disable the propagation. Then the log records will stay in the “requests” logger and will be invisible to the “root” logger.

If you're not sure what to config with, write a small example project to experiment the result.

Tags: python