How Uuencoding Works
做题目学习 https://www.zhihu.com/question/26598476/answer/45396765
http://email.about.com/od/emailbehindthescenes/a/uuencoding.htm
It begins with "begin". It ends with "end". In between, however, are lots and lots of random characters. Or so it seems.
Of course, the characters you see in the email attachment are not random.
They are uuencoded.
Uuencoding is a method of representing arbitrary任意的;武断的;专制的 binary data (such as programs, word processor documents, or images) in plain US-ASCII text.
Why Uuencode? 【一种编码的程式(可将二进制文件编码为文本文件)】
Why would anybody need a way for doing that?
Email was designed for textual messages using only English characters and cannot safely transport text in other languages let alone更不必说 binary files such as images or programs.
When you insert the latter in an email you never know what will come out at the recipient's接受者 end, or if anything will come out at all.
This is why a way to convert binary data to ASCII text (and vice versa反之亦然) was needed.
Uuencoding is one way, and in a sense a predecessor to MIME and its predominant Base64 encoding.
What Uuencoding Does
Like Base64 encoding, uuencoding takes three bytes of binary data and converts them to four bytes of ASCII text (which equals four characters).
每3个字节,拆成对应的4个ASCII文本 8*3拆为6*4
Each byte consists of eight bits.
For the conversion, uuencoding starts by concatenating连结;使连锁 the 24 bits (3 times 8) that make up the three binary data bytes and splitting them into four chunks of six bits each.
To get four complete bytes (which need eight bits but we only have six), '0' characters need to be placed in front of these chunks. 需要补0
We do not want to change the value of the chunks, 007 has the same value has 7.
So far, we have turned three bytes into four bytes by adding superfluous多余的 data and wasting space.
This is necessary to ensure each byte can be represented in ASCII characters.
Another step is necessary before we can begin that conversion, however.
Nonchalantly冷淡地,漠不关心地, we add 32 (100000 in binary notation记号法) to each of the four bytes. 拆出的4个字节,每个+32
This makes sure that the output data is indeed a printable character, which begin with ASCII 32, the white space.
To get the character, we look up the byte value in the ASCII codeset (65, for example, corresponds to 'A').
Now we have a way to transform binary data to ASCII text.
To make it ready for inclusion包含;内含物 in email messages, one more hurdle障碍 is waiting for us to be taken.
While there is no limit to the size of an email message, email servers can and do limit the length of a single line in an email.
We need to insert line breaks into our output data.
This is done after at most 45 characters. 每一行最多45个字符
Since the last line will not always be filled with 45 characters and to introduce a certain amount of data security, each line begins with a character indicating the length of the line. 每一行开头的字符指明了长度
To get this character, we apply the procedure that already got us ASCII characters for our binary bits to the line length: we add 32 and look up the character in the ASCII table. 把长度+32然后转换成对应的ASCII码
If the line is 45 bytes long, for example, we look up 77, which is 'M'. This is why you will find a capital M at the beginning of most lines in a uuencoded file. 长度45,加上32,得到77,对应M
Encoding Example
Let us assume we have three bytes of input: 155, 162 and 233.
The corresponding bit stream is 100110111010001011101001,
which in turn corresponds to the 6-bit values 100110, 111010, 001011 and 101001.
Now we turn these six-bit chunks into full bytes (00100110 = 38, 00111010 = 58, 00001011 = 11 and 00101001 = 41)
and add 32 to get 70 = 01000110, 90 = 01011010, 43 = 00101011 and 73 = 01001001.
Eventually, we look up these numbers in the ASCII table: 70 = F, 90 = Z, 43 = + and 73 = I.
Our input stream converts to the printable characters FZ+I.
FZ+I is 4 bytes long, plus 32 makes 36, which translates to '$'. So the full line of uuencoded data is $FZ+I.
Header and Footer
Only the "begin" and "end" are still missing.
The header line of a uuencoded file consists of the word "begin", the Unix file permission value (something like "664") and the file name.
A header line could look like this, for example: "begin 664 test.bin".
The footer is simply "end". A complete example, thus, would be:
begin 664 test.bin
$FZ+I
end
How Uuencoding Works的更多相关文章
- spring注解源码分析--how does autowired works?
1. 背景 注解可以减少代码的开发量,spring提供了丰富的注解功能.我们可能会被问到,spring的注解到底是什么触发的呢?今天以spring最常使用的一个注解autowired来跟踪代码,进行d ...
- [Unity][Heap sort]用Unity动态演示堆排序的过程(How Heap Sort Works)
[Unity][Heap sort]用Unity动态演示堆排序的过程 How Heap Sort Works 最近做了一个用Unity3D动态演示堆排序过程的程序. I've made this ap ...
- How PhoneGap & Titanium Works
转载自 http://www.appcelerator.com/blog/2012/05/comparing-titanium-and-phonegap/ How PhoneGap Works As ...
- Saying that Java is nice because it works on every OS is like saying that anal sex is nice because it works on every gender.
Saying that Java is nice because it works on every OS is like saying that anal sex is nice because i ...
- How Garbage Collection Really Works
Java Memory Management, with its built-in garbage collection, is one of the language's finest achiev ...
- 攻城狮在路上(肆)How tomcat works(零) 前言说明
最近几篇是关于How tomcat works一书的读书笔记. 通过数个章节逐渐实现一个tomcat的功能. 源码下载地址:http://zhidao.baidu.com/share/7007af0f ...
- The Magic only works with total devotion of one's heart
The Magic only works with total devotion of one's heart All tools and equipments are useless without ...
- How Tomcat works — 四、tomcat启动(3)
上一节说到StandardService负责启动其子组件:container和connector,不过注意,是有先后顺序的,先启动container,再启动connector,这一节先来看看conta ...
- How ADB works
ADB (Android Debug Bridge): How it works? 2012.2.6 early draft Tetsuyuki Kobayashi What is ADB? If y ...
随机推荐
- shell编程之数学运算
shell数学运算支持整数运算的四种方法 1.let命令 no1=4; no2=5; let result=no1+no2 2.[]操作符 result=$[ no1 + no2] 3.(())操作符 ...
- setrendertraget 上下颠倒
这个问题遇到两次了 之前一次是粒子rendertotexture 没设viewprot的时候是上下颠倒的 设置viewport之后就好了 现在在一个setrendertarget的地方又遇到了 上下颠 ...
- FormCreate & FormActivate & FormShow执行顺序演示
procedure TForm1.FormCreate(Sender: TObject);begin form1.Caption:=form1.Caption +'+Create'; end; pr ...
- WinHex分析PE格式(1)
最近在一直努力学习破解,但是发现我的基础太差了,就想学习一下PE结构.可是PE结构里的结构关系太复杂,看这老罗的WiN32汇编最后一章 翻两页又合上了..把自己的信心都搞没了.感觉自己的理解能力不行, ...
- 在C语言中嵌入汇编语言
TMS320C28x编译器允许在C程序中嵌入汇编指令,通过下面声明实现: asm(“assembler text”); 其中assembler text指汇编代码.asm指令一般用来处理C/C++语句 ...
- 较复杂js的书写格式
我们看较复杂的js程序最怕结构混乱,一个好的js书写结构,在很大程度上可以减缓阅读的障碍性. 我感觉一个良好的结构要有两点:一是要有一个统一的入口,这样就保证了程序的可阅读性:二是要能够灵活的设置参数 ...
- win8系统输入法设置
Windows 8系统自带微软拼音简捷输入法,无论是在Windows的开始屏幕新界面中还是Windows传统桌面里,按Shift键或者直接点击屏幕上的"中/英"标识即可切换中英文输 ...
- Java 并发同步器之CountDownLatch、CyclicBarrier
一.简介 1.CountDownLatch是一个同步计数器,构造时传入int参数,该参数就是计数器的初始值,每调用一次countDown()方法,计数器减1,计数器大于0 时,await()方法会阻塞 ...
- iOS应用间的跳转和传值
在第一个应用程序中info.plist设置 URL Identifier: 该字符串是你自定义的 URL scheme 的名字 注意: URL Schemes 是一个数组,允许应用定义多个 URL s ...
- lintcode:等价二叉树
等价二叉树 检查两棵二叉树是否等价.等价的意思是说,首先两棵二叉树必须拥有相同的结构,并且每个对应位置上的节点上的数都相等. 样例 1 1 / \ / \ 2 2 and 2 2 / / 4 4 就是 ...