1.模块简介

linecache主要用于缓存文件内容，如果下次继续读取该文件，则不需要打开文件，直接在缓存中获取该文件内容。

2.模块使用

模块的基本方法有getline，clearcache，getlines，checkcache；

方法getline主要用于获取指定行的内容；

方法clearcache主要用于清空缓存；

方法getlines主要用于从缓存中获取文件所有的行，如果缓存中没有该文件内容，则更新缓存，如果更新缓存失败（例如文件太大），则返回空列表；

方法checkcache主要用于删除超时的缓存；

example，

import linecache

import os

# 小文件名称

smallFileName = "BrowseQueryResult.txt"

# 大文件名称

bigFileName = "PaperID_mapping_to_AffiliationsID.txt"

# 获取小文件第一行

samllLine1 = linecache.getline(smallFileName,1)

print "small file line 1:" + samllLine1.decode("gb2312")

# 获取小文件所有数据

cacheSmall = linecache.getlines(smallFileName)

print "samll file length = %d"%(len(cacheSmall))

print "small file size = %d KB"%(os.path.getsize(smallFileName) * 1.0 / (1024))

# 获取大文件所有数据

cacheBig = linecache.getlines(bigFileName)

print "big file length = %d"%(len(cacheBig))

print "big file size = %d MB"%(os.path.getsize(bigFileName) * 1.0 / ( 1024* 1024))

linecache.clearcache()

控制台输出，可以发现，当系统配置低的时候，linecache.getlines获取大文件时，会失败，这时候得到的是一个空列表，通过os.path.getsize可以观察到文件的大小。

small file line 1:教育技术学视野下的未来课堂研究        1769    教育|199||教育技术|53||教育技术学|22||未来课堂|13||技术|12||教育技术学视野下的未来课堂研究|7||课堂教学模式|5||云计算|5||教学模式|5||课堂

|5||发展性教学|4||信息化教学模式|4||教育技术 并含 技能|4||颠倒课堂|4||课堂教学|4||在线教育|4||未来教室|4||课堂互动|4||计算机|3||思维导图|3||智慧教室|3||信息化教学|3||教育技术 技能|3||教育技术学视野下

的未来课堂|3||毕业论文|3||电子书包|3||合作学习|2||互联网|2||评价|2||提高远程教学交互实效的教学教法研究|2||教育技术研究方法|2||沉积物磷|2||数学 自主探究|2||教育技术技能|2||学习空间设计|2||教育技术发展|

2||电子商务|2||数字化校园|2||信息技术支持下的教育教学模式研究|2||情报 技术|2||末来课堂|2||模糊数学|2||绿色建筑|2||数学|2||心理学|2||物流|2||泛在学习|2||财务管理|2||未来|2||信息技术|2

samll file length = 100

small file size = 103 KB

big file length = 0

big file size = 3219 MB

3.源码分析

linecache源码所在路径为Python-2.7.10\Lib\linecache.py，

源码如下，

"""Cache lines from files.

This is intended to read lines from modules imported -- hence if a filename

is not found, it will look down the module search path for a file by

that name.

"""

import sys

import os

__all__ = ["getline", "clearcache", "checkcache"]

# 获取指定行的内容

def getline(filename, lineno, module_globals=None):

    # 利用getlines获取所有行

    lines = getlines(filename, module_globals)

    # 如果指定行在文件总行数范围之内，则返回相应的该行的数据

    if 1 <= lineno <= len(lines):

        return lines[lineno-1]

    else:

        return ''

# cache的数据格式为cache[filename] = size, mtime, lines, fullname

# filename为文件名

# size为文件大小

# mtime为文件修改时间

# lines为文件所有的数据

# fullname为文件的全名

cache = {} # The cache

# 清空缓存

def clearcache():

    """Clear the cache entirely."""

    # 引入全局变量cache

    global cache

    # 将cache设置为空

    cache = {}

# 从缓存中获取文件所有的行，如果缓存中没有该文件内容，则更新缓存，如果更新缓存失败（例如文件太大），则返回空列表

def getlines(filename, module_globals=None):

    """Get the lines for a file from the cache.

    Update the cache if it doesn't contain an entry for this file already."""

    # 如果文件名在cache中，则返回cache中该文件的全部数据

    if filename in cache:

        return cache[filename][2]

    # 否则，更新cache

    try:

        return updatecache(filename, module_globals)

    # 如果更新cache时，发生内存错误，则返回空列表

    except MemoryError:

        clearcache()

        return []

# 删除超时的缓存cache

def checkcache(filename=None):

    """Discard cache entries that are out of date.

    (This is not checked upon each call!)"""

    if filename is None:

        filenames = cache.keys()

    else:

        if filename in cache:

            filenames = [filename]

        else:

            return

    for filename in filenames:

        size, mtime, lines, fullname = cache[filename]

        if mtime is None:

            continue   # no-op for files loaded via a __loader__

        # 获取cache中文件的修改时间

        try:

            stat = os.stat(fullname)

        # auguries出错，则将cache中的该文件内容删除

        except os.error:

            del cache[filename]

            continue

        # 如果文件大小和修改时间均不相等，则将cache中的该文件内容删除

        if size != stat.st_size or mtime != stat.st_mtime:

            del cache[filename]

# 更新缓存

def updatecache(filename, module_globals=None):

    """Update a cache entry and return its list of lines.

    If something's wrong, print a message, discard the cache entry,

    and return an empty list."""

    # 如果文件名在缓存cache中，将cache中的该文件内容删除

    if filename in cache:

        del cache[filename]

    # 如果文件名不合法，则返回空列表

    if not filename or (filename.startswith('<') and filename.endswith('>')):

        return []

    # 将文件名设置为文件名全称

    fullname = filename

    # 获取该文件名全称的状态

    try:

        stat = os.stat(fullname)

    # 如果出错，则将文件名设置为基本文件名

    except OSError:

        basename = filename

        # Try for a __loader__, if available

        if module_globals and '__loader__' in module_globals:

            name = module_globals.get('__name__')

            loader = module_globals['__loader__']

            get_source = getattr(loader, 'get_source', None)

            if name and get_source:

                try:

                    data = get_source(name)

                except (ImportError, IOError):

                    pass

                else:

                    if data is None:

                        # No luck, the PEP302 loader cannot find the source

                        # for this module.

                        return []

                    cache[filename] = (

                        len(data), None,

                        [line+'\n' for line in data.splitlines()], fullname

                    )

                    return cache[filename][2]

        # Try looking through the module search path, which is only useful

        # when handling a relative filename.

        if os.path.isabs(filename):

            return []

        # 从系统路径中获取目录路径

        for dirname in sys.path:

            # When using imputil, sys.path may contain things other than

            # strings; ignore them when it happens.

            try:

                fullname = os.path.join(dirname, basename)

            except (TypeError, AttributeError):

                # Not sufficiently string-like to do anything useful with.

                continue

            try:

                stat = os.stat(fullname)

                break

            except os.error:

                pass

        else:

            return []

    # 通过file.readlines()读取文件的所有内容

    try:

        with open(fullname, 'rU') as fp:

            lines = fp.readlines()

    # 读取失败，则返回空列表

    except IOError:

        return []

    if lines and not lines[-1].endswith('\n'):

        lines[-1] += '\n'

    size, mtime = stat.st_size, stat.st_mtime

    # 以filename为key，(size, mtime, lines, fullname)为value

    cache[filename] = size, mtime, lines, fullname

    return lines

Python标准模块--linecache的更多相关文章

Python标准模块--threading
1 模块简介 threading模块在Python1.5.2中首次引入,是低级thread模块的一个增强版.threading模块让线程使用起来更加容易,允许程序同一时间运行多个操作. 不过请注意,P ...
Python标准模块--logging
1 logging模块简介 logging模块是Python内置的标准模块,主要用于输出运行日志,可以设置输出日志的等级.日志保存路径.日志文件回滚等:相比print,具备如下优点: 可以通过设置不同 ...
Python标准模块--importlib
作者:zhbzz2007 出处:http://www.cnblogs.com/zhbzz2007 欢迎转载,也请保留这段声明.谢谢! 1 模块简介 Python提供了importlib包作为标准库的一 ...
Thread类的其他方法,同步锁,死锁与递归锁,信号量,事件,条件,定时器,队列,Python标准模块--concurrent.futures
参考博客: https://www.cnblogs.com/xiao987334176/p/9046028.html 线程简述什么是线程?线程是cpu调度的最小单位进程是资源分配的最小单位进程和线 ...
python 全栈开发，Day42(Thread类的其他方法,同步锁,死锁与递归锁,信号量,事件,条件,定时器,队列,Python标准模块--concurrent.futures)
昨日内容回顾线程什么是线程?线程是cpu调度的最小单位进程是资源分配的最小单位进程和线程是什么关系? 线程是在进程中的一个执行单位多进程本质上开启的这个进程里就有一个线程多线程单纯的在当 ...
【转】Python标准模块--importlib
[转]Python标准模块--importlib 作者:zhbzz2007 出处:http://www.cnblogs.com/zhbzz2007 欢迎转载,也请保留这段声明.谢谢! 1 模块简介 P ...
Python标准模块--logging(转载)
转载地址:http://www.cnblogs.com/zhbzz2007/p/5943685.html#undefined Python标准模块--logging 1 logging模块简介 log ...
python全栈开发，Day42（Thread类的其他方法，同步锁，死锁与递归锁，信号量，事件，条件，定时器，队列，Python标准模块--concurrent.futures）
昨日内容回顾线程什么是线程? 线程是cpu调度的最小单位进程是资源分配的最小单位进程和线程是什么关系? 线程是在进程中的一个执行单位多进程本质上开启的这个进程里就有一个线程多线程单纯的 ...
python标准模块（二）
本文会涉及到的模块: json.pickle urllib.Requests xml.etree configparser shutil.zipfile.tarfile 1. json & p ...

随机推荐

Linux基础命令-有关于目录的命令
1. 查看帮助: [root@oracle ~]# man cd //查看 cd 指令的帮助文档 2. 显示当前工作目录: [root@oracle ~]# pwd/root 3. 列出当前目录下的内 ...
PyCharm 代码完成/代码提示
因为python是动态语言,所以在有些情况ide会无法有效代码提示,见下: import sqlite3 conn = sqlite3.connect('d:/xxx.db') conn. #这里按 ...
C#委托与事件的简单使用
前言:上一篇博文从原理和定义的角度介绍了C#的委托和事件.本文通过一个简单的小故事,来说明C#委托与事件的使用方法及其方便之处. 在阅读本文之前,需要你对委托和事件的基本概念有所了解.如果你是初次接触 ...
indows server 2008 多用户远程桌面连接设置（验证有效
然后,在运行框中输入 gpedit.msc 之后,点击确定或者直接按键盘上的回车键计算机配置-->管理模板-->Windows组件---->远程桌面服务--->远程桌面会话 ...
向mysql中插入Date类型的数据
先看数据库表的定义 date字段为sql.date类型.我要向其中插入指定的日期和当前日期. 一.插入当前日期思路:先获取当前系统,在将当前系统时间转换成sql类型的时间,然后插入数据库.代码如下 ...
JAVA 笔试笔记
1.java优缺点优点 :纯面对对象,跨平台,提供很多内置的类库,支持web开发,有较好的健壮性和安全性缺点 : 速度慢,跨平台不能像其他的语言一样接近操作系统,复杂 2.java与c/c++ 都是 ...
RadioGroup实现导航栏
[声明] 欢迎转载,但请保留文章原始出处→_→ 生命壹号:http://www.cnblogs.com/smyhvae/ 文章来源:http://www.cnblogs.com/smyhvae/p/ ...
（翻译）开始iOS 7中自动布局教程(二)
这篇教程的前半部分被翻译出来很久了,我也是通过这个教程学会的IOS自动布局.但是后半部分(即本篇)一直未有翻译,正好最近跳坑翻译,就寻来这篇教程,进行翻译.前半部分已经转载至本博客,后半部分即本篇.学 ...
c++ eof()函数
C++ eof()函数可以帮助我们用来判断文件是否为空,抑或是判断其是否读到文件结尾.在这里我们将会对其进行详细的介绍. C++编程语言中的很多功能在我们的实际应用中起着非常大的作用.比如在对文件文本 ...
IOS_反射
// // PJReflect.m // 新浪微博 // // Created by pj on 14-8-8. // Copyright (c) 2014年 pj. All rights reser ...

Python标准模块--linecache

1.模块简介

2.模块使用

3.源码分析

Python标准模块--linecache的更多相关文章

随机推荐

热门专题