[Python 多线程] Semaphore、BounedeSemaphore (十二)

Semaphore

信号量，信号量对象内部维护一个倒计数器，每一次acquire都会减1，当acquire方法发现计数为0就阻塞请求的线程，直到其它线程对信号量release后，计数大于0，恢复阻塞的线程。

方法：

Semaphore(value=1) 构造方法。value小于0，抛ValueError异常。默认为1。

acquire(blocking=True,timeout=None) 获取信号量，计数器减1，获取成功返回True。

release() 释放信号量，计数器加1。

计数器永远不会低于0，因为acquire的时候，发现是0，都会被阻塞。

举例：

图书馆有三本书，三本都被借走(acquire)之后，其他人想看，就得等别人还回来(阻塞)，有人还回来(release)一本后，就有一个人可以拿到这本书，其他人仍然得等归还。

#Semaphore 信号量，借还

import threading,logging,time

DATEFMT="%H:%M:%S"

FORMAT = "[%(asctime)s]\t [%(threadName)s,%(thread)d] %(message)s"

logging.basicConfig(level=logging.INFO,format=FORMAT,datefmt=DATEFMT)

def work(s:threading.Semaphore):

    logging.info('in sub thread')

    logging.info(s.acquire())

    logging.info('sub thread oevr')

s = threading.Semaphore(3)

logging.info(s.acquire())

logging.info(s.acquire())

logging.info(s.acquire())

threading.Thread(target=work,args=(s,)).start()

time.sleep(2)

logging.info(s.acquire(False))  #不阻塞

logging.info((s.acquire(timeout=3))) #3秒超时会阻塞

logging.info('release')

s.release()

运行结果：

[08:48:43]	 [MainThread,8840] True

[08:48:43]	 [MainThread,8840] True

[08:48:43]	 [MainThread,8840] True

[08:48:43]	 [Thread-1,6212] in sub thread

[08:48:45]	 [MainThread,8840] False

[08:48:48]	 [MainThread,8840] False

[08:48:48]	 [MainThread,8840] release

[08:48:48]	 [Thread-1,6212] True

[08:48:48]	 [Thread-1,6212] sub thread oevr

　　这个例子只起了一个线程，如果多起几个，当release还回来的数小于阻塞的线程数时，程序就会一直处于阻塞状态，直到全部relase。

应用举例：

因为资源有限，且开启一个连接成本高，所以，使用连接池。

一个简单的连接池(例子)：

连接池应该有容量(value总数)，也应该工厂方法可以获取连接，能够把不用的连接归还，供其他使用者使用。

#一个简单的连接池

import threading,logging,time

DATEFMT="%H:%M:%S"

FORMAT = "[%(asctime)s]\t [%(threadName)s,%(thread)d] %(message)s"

logging.basicConfig(level=logging.INFO,format=FORMAT,datefmt=DATEFMT)

class Conn:

    def __init__(self,name):

        self.name = name

class Pool:

    def __init__(self,count=3):

        self.count = count

        #连接池容器

        self.pool = [self._connect('conn-{}'.format(x)) for x in range(self.count)]

    def _connect(self,conn_name):

        return Conn(conn_name)

    def get_conn(self):

        # if len(self.pool) > 0:

        return self.pool.pop() #从尾部弹出一个

    def return_conn(self,conn:Conn):

        self.pool.append(conn)

pool = Pool(3)

print(pool.pool)

pool.get_conn()

pool.get_conn()

pool.get_conn()

pool.get_conn() #第4个

print('End Main')

运行结果：

[<__main__.Conn object at 0x00000211BBEBC160>, <__main__.Conn object at 0x00000211BBEBC1D0>, <__main__.Conn object at 0x00000211BBEBC240>]

Traceback (most recent call last):

  File "C:/python/test.py", line 34, in <module>

    pool.get_conn()

  File "C:/python/test.py", line 24, in get_conn

    return self.pool.pop() #从尾部弹出一个

IndexError: pop from empty list

　　当连接池中已经没有可用连接时，再获取就会抛异常 IndexError：pop from empty list。

那就加个判断，只在池中连接数量大于0的时候才可以获取连接：

#修改get_conn函数

    def get_conn(self):

        if len(self.pool) > 0:

            return self.pool.pop() #从尾部弹出一个

　　这样在连接池为空时，就不会抛异常了。

这个连接池的例子如果使用多线程，这个get_conn()方法是线程不安全的，有可能其它线程看到池中还有一个连接，正准备获取，其它线程也看到了，也准备获取连接，就会抛异常。再或者，都在向池中加连接的时候，也可能会多加。

这个问题可以用锁Lock来解决，在获取连接和加连接时，加锁解锁；也可以使用semaphore信号量来解决。

使用信号量对上例进行修改：

#使用semaphore信号量修改连接池

import threading,logging,time,random

DATEFMT="%H:%M:%S"

FORMAT = "[%(asctime)s]\t [%(threadName)s,%(thread)d] %(message)s"

logging.basicConfig(level=logging.INFO,format=FORMAT,datefmt=DATEFMT)

class Conn:

    def __init__(self,name):

        self.name = name

    def __repr__(self):

        return self.name

class Pool:

    def __init__(self,count=3):

        self.count = count

        #连接池容器

        self.pool = [self._connect('conn-{}'.format(x)) for x in range(self.count)]

        self.semaphore = threading.Semaphore(self.count)

    def _connect(self,conn_name):

        #返回一个连接名

        return Conn(conn_name)

    def get_conn(self):

        #从池中拿走一个连接

        # if len(self.pool) > 0:

        self.semaphore.acquire(timeout=5) #-1,获取连接,最大5秒超时时间,与后面随机秒数相对应

        data = self.pool.pop() #从尾部弹出一个

        return data

    def return_conn(self,conn:Conn):

        #向池中添加一个连接

        self.pool.append(conn)

        self.semaphore.release()  # 先加入池中再信号量+1

        return len(self.pool)

pool = Pool(3)

def worker(pool:Pool):

    conn = pool.get_conn()

    logging.info(conn)

    #模拟使用了资源一段时间(随机1-4秒),然后归还

    threading.Event().wait(timeout=random.randint(1,4))

    pool.return_conn(conn)

for i in range(6):

    threading.Thread(target=worker,name="worker-{}".format(i),args=(pool,)).start()

print('End Main')

运行结果：

[10:34:12]	 [worker-0,5264] conn-2

[10:34:12]	 [worker-1,7420] conn-1

[10:34:12]	 [worker-2,2612] conn-0

End Main

[10:34:13]	 [worker-3,3972] conn-1 #归还以后又可以获取连接

[10:34:14]	 [worker-4,8172] conn-2

[10:34:15]	 [worker-5,11192] conn-1

　　上例中模拟获取连接以后使用了1-4秒钟，没有拿到资源的最多阻塞5秒钟，当连接使用结束归还后，阻塞的线程就又重新获取到连接。

问题：

1) 没有使用信号量就release的情况：

import threading

s = threading.Semaphore(3)

print(s.__dict__)

def work(s:threading.Semaphore):

    s.release()

for i in range(3):

    threading.Thread(target=work,args=(s,)).start()

    print(s.__dict__)

运行结果：

{'_cond': <Condition(<unlocked _thread.lock object at 0x00000219202973A0>, 0)>, '_value': 3}

{'_cond': <Condition(<unlocked _thread.lock object at 0x00000219202973A0>, 0)>, '_value': 2}

{'_cond': <Condition(<unlocked _thread.lock object at 0x00000219202973A0>, 0)>, '_value': 3}

{'_cond': <Condition(<unlocked _thread.lock object at 0x00000219202973A0>, 0)>, '_value': 4}

{'_cond': <Condition(<unlocked _thread.lock object at 0x00000219202973A0>, 0)>, '_value': 5}

　　没有acquire信号量时，就release的情况，结果导致了信号量的内置倒计数器的值增加，这样就超出了最大值。

解决办法：

使用BoundedSemaphore类：

BoundedSemaphore，继承自Semaphore类。边界绑定，有界的信号量，不允许使用release超过初始值的范围，否则，抛ValueError异常。

#BoundedSemaphore边界绑定

import threading

s = threading.BoundedSemaphore(3)

print(s.__dict__)

s.acquire()

print(s.__dict__)

def work(s:threading.BoundedSemaphore):

    s.release()

for i in range(3):

    threading.Thread(target=work,args=(s,)).start()

    print(s.__dict__)

运行结果：

{'_value': 3, '_cond': <Condition(<unlocked _thread.lock object at 0x000001A42DDF73A0>, 0)>, '_initial_value': 3}

{'_value': 2, '_cond': <Condition(<unlocked _thread.lock object at 0x000001A42DDF73A0>, 0)>, '_initial_value': 3}

{'_value': 3, '_cond': <Condition(<unlocked _thread.lock object at 0x000001A42DDF73A0>, 0)>, '_initial_value': 3}

{'_value': 3, '_cond': <Condition(<unlocked _thread.lock object at 0x000001A42DDF73A0>, 0)>, '_initial_value': 3}

{'_value': 3, '_cond': <Condition(<unlocked _thread.lock object at 0x000001A42DDF73A0>, 0)>, '_initial_value': 3}

Exception in thread Thread-2:

Traceback (most recent call last):

  File "C:/python/test.py", line 11, in work

    s.release()

ValueError: Semaphore released too many times

　　使用BoundedSemaphore就可以控制不会多归还。

[Python 多线程] Semaphore、BounedeSemaphore (十二)的更多相关文章

Python 多线程、多进程（二）之多线程、同步、通信
Python 多线程.多进程 (一)之源码执行流程.GIL Python 多线程.多进程 (二)之多线程.同步.通信 Python 多线程.多进程 (三)之线程进程对比.多线程一.python ...
Python开发【第二十二篇】：Web框架之Django【进阶】
Python开发[第二十二篇]:Web框架之Django[进阶] 猛击这里:http://www.cnblogs.com/wupeiqi/articles/5246483.html 博客园首页 ...
java 轻量级同步volatile关键字简介与可见性有序性与synchronized区别多线程中篇（十二）
概念 JMM规范解决了线程安全的问题,主要三个方面:原子性.可见性.有序性,借助于synchronized关键字体现,可以有效地保障线程安全(前提是你正确运用) 之前说过,这三个特性并不一定需要全部同 ...
“全栈2019”Java多线程第三十二章：显式锁Lock等待唤醒机制详解
难度初级学习时间 10分钟适合人群零基础开发语言 Java 开发环境 JDK v11 IntelliJ IDEA v2018.3 文章原文链接 "全栈2019"Java多 ...
python自动华（十二）
Python自动化 [第十二篇]:Python进阶-MySQL和ORM 本节内容数据库介绍 mysql 数据库安装使用 mysql管理 mysql 数据类型常用mysql命令创建数据库外键增 ...
Java 多线程基础（十二）生产者与消费者
Java 多线程基础(十二)生产者与消费者一.生产者与消费者模型生产者与消费者问题是个非常典型的多线程问题,涉及到的对象包括“生产者”.“消费者”.“仓库”和“产品”.他们之间的关系如下: ①.生 ...
python运维开发(十二)----rabbitMQ、pymysql、SQLAlchemy
内容目录: rabbitMQ python操作mysql,pymysql模块 Python ORM框架,SQLAchemy模块 Paramiko 其他with上下文切换 rabbitMQ Rabbit ...
Python学习【第十二篇】模块(2)
序列化 1.什么是python序列化? 把变量从内存中变成可存储或传输的过程称之为序列化,在Python中叫pickling 序列化就是将python的数据类型转换成字符串反序列化就是将字符串转换成 ...
Python 多线程进程高级指南(二)
本文是如何<优雅地实现Python通用多线程/进程并行模块>的后续.因为我发现,自认为懂了一点多线程开发的皮毛,写了那么个multi_helper的玩意儿,后来才发现我靠原来就是一坨屎.自 ...

随机推荐

Apache Rewrite的主要功能
#Apache Rewrite的主要功能 #就是实现URL的跳转和隐藏真实地址,基于Perl语言的正则表达式规范.#平时帮助我们实现拟静态,拟目录,域名跳转,防止盗链等在httpd.conf中去掉下 ...
IntelliJ IDEA 2018.3 安装+永久激活[Windows]
IntelliJ IDEA 作为一个优秀的Java开发环境,深受许多开发者喜爱,但是它的价格却贵得让人无法接受,这篇文章将介绍永久激活IntelliJ IDEA的方法(使用破解补丁). 系统环境:Wi ...
js中获取css样式的两种方式
1. 对象.style.样式名弊端就是只能获取行内样式 2.window.getComputedStyle(对象,null); 最好用第二种方式 <!DOCTYPE html> < ...
一类划分关系和指数级生成函数，多项式exp的关系
划分关系姑且这么叫着设满足性质 \(A\) 的集合为 \(S_A\),每个元素有标号如果 \(S_B\) 是由若干个 \(S_A\) 组成的一个大集合设 \(a_i\) 表示大小为 \(i\) ...
BZOJ4568: [Scoi2016]幸运数字(线性基倍增)
题意题目链接 Sol 线性基是可以合并的倍增维护一下然后就做完了?? 喵喵喵? // luogu-judger-enable-o2 #include<bits/stdc++.h> # ...
Typescript中一些不理解的概念解释（泛型、断言、解构、枚举）
新的项目想使用typescript,因此又对其概念及使用过一遍,本文主要记录下对之前一些概念不太理解的地方. 1.泛型定义: 在定义函数.接口或者类的时候,不预先指定具体的类型,而是在使用的时候再指 ...
移动前端调试页面–weinre
安装 npm install -g weinre 启动 weinre --boundHost -all- 浏览器查看 http://localhost:8080 插入相关文件 index.html d ...
springboot中filter的配置和顺序执行
项目结构 springboot版本 <parent> <groupId>org.springframework.boot</groupId> <artifac ...
SVG坐标系统和transformation彻底理解
翻译自https://sarasoueidan.com/blog/svg-coordinate-systems/ SVG元素不像传统的HTML elements一样受制于css box model.这 ...
--Too small initial heap for new size specified
虽然Java屏蔽了一下内存细节,但是有时候,了解一下这些常识还是有好处的,特别是一些面试,总是盯着这些玩意不放手. JVM启动以后,会分配两类内存区域,一类用于开发人员使用,比如保存一些变量,对象等, ...

[Python 多线程] Semaphore、BounedeSemaphore (十二)

[Python 多线程] Semaphore、BounedeSemaphore (十二)的更多相关文章

随机推荐

热门专题