0% found this document useful (0 votes)

50 views

Anti Virus 2.0 "Compilers in Disguise": Mihai G. Chiriac Bitdefender

The document discusses optimizing emulator performance for antivirus scanning. It begins by discussing the history of antivirus techniques and challenges with emulation. It then proposes using an intermediate language and optimizations like static single assignment form to improve performance of decryption loop emulation. The document suggests three execution modes: simulating micro-operations directly, linking micro-function simulations, or generating target-specific machine code. Faster execution speeds can be achieved by combining multiple micro-operations or generating native code.

Uploaded by

admiral9hacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Anti Virus 2.0 "Compilers in Disguise": Mihai G. Chiriac Bitdefender

Uploaded by

admiral9hacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Anti Virus 2.

0
“Compilers in disguise”
Mihai G. Chiriac
BitDefender
Talk outline
• AV History
• Emulation basics
• Compiler technology
– Intermediate Language
– Optimizations
– Code generation
• Conclusions
AV History!
• String searching
– Aho-Corasick, KMP, Boyer-Moore
– PolySearch
– Bookmarks (from top, tail, EP?)
– Hashes (B, ofs1, sz1, crc1, ofs2, sz2, crc2)

NYB.A
AV History (cont’d)
• Encrypted viruses
– Static decryption loop (signature)
– Simple encryption (xray-ing)
– Algorithmic detection

Cascade.1706 decryption loop

AV History (cont’d)
• Polymorphic viruses
– Simple encryption (xray-ing)
– Algorithmic detection, heuristics

TPE.Giraffe.A
Emulation!
• Hardware
– Virtual CPU
– Virtual memory
– Virtual devices
• Software
– Partial OS simulation
• Bonus Goodies
– Fake IRC, SMTP, DNS, etc servers
Workflow
• Init the CPU / VM
• Init the virtual OS
– Modules
– Structures
• Map the file
• Start emulation from cs:eip
• Scan (when conditions are met)
• Quit (when conditions are met)
Sample

Win32.Parite (Pinfi) decryption loop

Ready to emulate our first
instruction?
…Not yet…
Chores! 
• Pre-instruction tasks
– DRx handling
– Segment access rights
– Page access rights
• Post-instruction tasks
– TF handling
– Update the virtual EIP
– Update the EI number
Tasks
• Fetch instruction from cs:eip
• Decode
– Handle prefixes

• Emulate!
• Easy, huh? 
On average,
one every three instructions
references memory…

Memory accesses
• We have to virtualize the entire 4 GB
space….
• Every memory access needs:
– Segment access checks
– Linear address computation
– Page access checks
– Hardware debugging checks
– SMC checks!! (for memory writes)
Problems…
• Millions of instructions…
• Polymorphic decryption loops are full of
do-nothing, “garbage” code…
• Decompression loops are optimized for
size, not speed…

• …This results in unacceptable

performance …
600000

200000
400000
Parses 800000

0
1000000
1200000
1400000
1600000

0x00565577 ->…
0x005D1EF2 ->…
0x005D1F04 ->…
0x005D1F1C ->…
0x005D1F33 ->…
0x005D1F4F ->…
0x005D1F61 ->…
0x005D1F74 ->…
0x005D1F9A ->…
UPX decompression

0x005D1FC3 ->…
0x005D1FF6 ->…
0x005D201F ->…
0x7FF00C0E ->…
0x7FF80430 ->…
0x7FF80498 ->…
0x7FF804E8 ->…
0x7FF80521 ->…
0x7FF8055D ->…
Advantages
• Typically, an emulator spends the most
time in loops…

• A small percentage of code is responsible

for a large percentage of emulation time…

• So… we know what to optimize!

The plan
• Identify hot-spots
– Basic blocks that execute very frequently

• Try to make them run as fast as possible

– Reducing to a minimum the set of repetitive
actions
– Reducing to a minimum the number of
reduntant operations
Back to our code…
• .420010 (31 1C 3E) xor [esi+edi], ebx
First thoughts
• For loops, keep the opcodes already
decoded!
• Memory model is usually flat…
– We can catch accesses to DS, SS,…
• Hardware debugging rarely used…
– We can catch accesses to DRx
• Trap Flag rarely used…
– We can monitor accesses to EFlags
Back to our code…
• .420010 (31 1C 3E) xor [esi+edi], ebx
But we can do much more!
• x86 - Very rich instruction set
– One instruction – many basic operations
– Different encodings, same result
– Hard(er) to optimize…

• Mike’s Intermediate Language Format

• …apparently the acronym is taken 
IL Basics
• Very RISC-like
• Single-purpose micro-operations
• Infinite number of virtual registers
• Many info, useful for optimizations
– Operation type, operands
– Input / output variables (use-define)
• Many info, useful for dynamic analysis
– Memory access info
Parite.A decryption (1)
• Decrypt:
• .420010 xor [esi+edi], ebx
• .420013 sub esi, 2
• .420016 sub esi, 2
• .420019 jnz Decrypt

Compute_ZF (tm1)
mm0 = esi + edi Compute_SF (tm1)
tm0 = load32 (mm0) Compute_PF (tm1)
tm1 = tm0 ^ ebx
store32 (mm0, tm1) Compute_OF (OP_XOR, …)
Compute_AF (OP_XOR, …)
Compute_CF (OP_XOR, …)
Parite.A decryption (2)
• Decrypt:
• .420010 xor [esi+edi], ebx
• .420013 sub esi, 2
• .420016 sub esi, 2
• .420019 jnz Decrypt

Compute_ZF (esi)
Compute_SF (esi)
tm0 = esi Compute_PF (esi)
esi = esi – 2
Compute_OF (OP_SUB, …)
Compute_AF (OP_SUB, …)
Compute_CF (OP_SUB, …)
Parite.A decryption (3)
• Decrypt:
• .420010 xor [esi+edi], ebx
• .420013 sub esi, 2
• .420016 sub esi, 2
• .420019 jnz Decrypt

Compute_ZF (esi)
Compute_SF (esi)
tm0 = esi Compute_PF (esi)
esi = esi – 2
Compute_OF (OP_SUB, …)
Compute_AF (OP_SUB, …)
Compute_CF (OP_SUB, …)
Parite.A decryption (4)

• We can follow the use-def chains and

remove unnecessary micro-ops…

mm0 = esi + edi Compute_ZF (esi)

tm0 = load32 (mm0) Compute_SF (esi)
tm1 = tm0 ^ ebx Compute_PF (esi)
store32 (mm0, tm1)
esi = esi – 2 Compute_OF (OP_SUB, …)
tm0 = esi Compute_AF (OP_SUB, …)
esi = esi – 2 Compute_CF (OP_SUB, …)
Parite.A decryption (5)

• We can compute some values only if

really needed…

mm0 = esi + edi

tm0 = load32 (mm0)
tm1 = tm0 ^ ebx Set_LazyFlags (OP_SUB, …)
store32 (mm0, tm1) Compute_ZF (esi)
esi = esi – 2
tm0 = esi
esi = esi – 2
Static single assignment
Sample code… Three-address code..

int a, b, c; int a, b, c;

a = 5; a = 5;
b = 3; b = 3;
c = a + b + 3; c = a + b;
b = c + 1; c = c + 3;
b = c + 1;
SSA (cont’d)
Three-address code… SSA Form

a = 5; a[0] = cnst(5)
b = 3; b[0] = cnst(3)
c = a + b; c[0] = a[0]+b[0]
c = c + 3; c[1] = c[0]+cnst(3)
b = c + 1; b[1] = c[1]+cnst(1)

Easy! Create a different version for every variable state!

SSA (cont’d)
SSA Form… Graph!
b[1]
+
a[0] = cnst(5) / \
b[0] = cnst(3) c[1] cnst (1)
c[0] = a[0]+b[0] /
+
c[1] = c[0]+cnst(3) / \
b[1] = c[1]+cnst(1) c[0] cnst (3)
+
/ \
a[0] b[0]
SSA (cont’d)
• Very simple optimization
b[1]
framework +
– Constant folding / \
c[1] cnst (1)
– Constant propagation /
+
– Common sub-expression / \
c[0] cnst (3)
elimination +
/ \
– Dead code removal a[0] b[0]

• Expensive, so it’s used

only when needed…
Memory!
0040517E 812B 84F1183C SUB DWORD PTR DS:[EBX],3C18F184
00405184 832B 96 SUB DWORD PTR DS:[EBX],-6A
00405187 013B ADD DWORD PTR DS:[EBX],EDI
00405189 D1CF ROR EDI,1
0040518D 832B DF SUB DWORD PTR DS:[EBX],-21
00405190 812B 69802E61 SUB DWORD PTR DS:[EBX],612E8069
00405196 29C9 SUB ECX,ECX
00405198 812B CD05B390 SUB DWORD PTR DS:[EBX],90B305CD
0040519E 832B 79 SUB DWORD PTR DS:[EBX],79
004051A3 87C1 XCHG ECX,EAX
004051A5 29D1 SUB ECX,EDX
004051A7 832B C9 SUB DWORD PTR DS:[EBX],-37
004051AE 2933 SUB DWORD PTR DS:[EBX],ESI
…

Win32.Harrier decryption loop (partial)

Challenges
• Memory locations = variables, but…
– Hard to prove the addresses are valid…
– Problems with pointer aliasing (including
ESP!!)

• A possible solution
– Perform these optimizations only after we’ve
gathered a set of run-time data…
Execution modes – 1
• No code generation! 
• Simply simulate the micro-ops
• Advantages:
– Very portable
– Easy to profile
– Easy to debug
• Disadvantages:
– Slow 
PSP 
Execution modes – 2
• Trivial code generation…

Untitled
No ratings yet
Untitled
207 pages
Answers 2 Reviews and Exercises
No ratings yet
Answers 2 Reviews and Exercises
26 pages
Core Security Introduction To Software Vulnerability Exploitation
No ratings yet
Core Security Introduction To Software Vulnerability Exploitation
74 pages
Integers Floating Point: N N S E
No ratings yet
Integers Floating Point: N N S E
4 pages
Analysis and Visualization of Common Packers
No ratings yet
Analysis and Visualization of Common Packers
53 pages
x86 Architecture - Windows drivers _ Microsoft Learn
No ratings yet
x86 Architecture - Windows drivers _ Microsoft Learn
13 pages
Lesson 2.1 - Intro + x86-x64 Assembly
No ratings yet
Lesson 2.1 - Intro + x86-x64 Assembly
33 pages
Intel Cheat Sheet
No ratings yet
Intel Cheat Sheet
8 pages
Referral Sheet Format
No ratings yet
Referral Sheet Format
2 pages
CSO Cache Memory Numericals
No ratings yet
CSO Cache Memory Numericals
12 pages
Assembly #1
No ratings yet
Assembly #1
8 pages
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
No ratings yet
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
110 pages
A Crash Course On x86 Disassembly
No ratings yet
A Crash Course On x86 Disassembly
23 pages
lecture01-intro
No ratings yet
lecture01-intro
67 pages
Guide To Using Assembly in Visual Studio
100% (1)
Guide To Using Assembly in Visual Studio
7 pages
Armadillo v3 7-OEPFinder
No ratings yet
Armadillo v3 7-OEPFinder
9 pages
Assembly Language Crash Course
No ratings yet
Assembly Language Crash Course
7 pages
01 Lecture02
No ratings yet
01 Lecture02
78 pages
CSCI 232: Introduction To Assembly
No ratings yet
CSCI 232: Introduction To Assembly
59 pages
Encriptador y Desencriptador en Ensamblador
No ratings yet
Encriptador y Desencriptador en Ensamblador
19 pages
Lab Manual Coal
No ratings yet
Lab Manual Coal
15 pages
x86 Instructions - Windows drivers _ Microsoft Learn
No ratings yet
x86 Instructions - Windows drivers _ Microsoft Learn
14 pages
Opcodes Support: 6.1 Generated Files
No ratings yet
Opcodes Support: 6.1 Generated Files
4 pages
Reversing Basics - A Practical Approach: Author: Amit Malik (Double - Zer0) E-Mail
No ratings yet
Reversing Basics - A Practical Approach: Author: Amit Malik (Double - Zer0) E-Mail
9 pages
ch4 Handouts
No ratings yet
ch4 Handouts
72 pages
Compiler Design Code Generation
No ratings yet
Compiler Design Code Generation
4 pages
8086 Instruction Set
No ratings yet
8086 Instruction Set
66 pages
Class04 X86assembly
No ratings yet
Class04 X86assembly
44 pages
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
No ratings yet
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
189 pages
OS Structure
No ratings yet
OS Structure
23 pages
IntroductionToIntelx86 Part1 PDF
No ratings yet
IntroductionToIntelx86 Part1 PDF
113 pages
Unit II: Instruction Set and Addressing Modes
No ratings yet
Unit II: Instruction Set and Addressing Modes
53 pages
6800 Instruction Set
No ratings yet
6800 Instruction Set
5 pages
X86 Opcode Reference 64-Bit Edition: General, System, x87 FPU, MMX, SSE (1), SSE2, SSE3, SSSE3 Opcodes
No ratings yet
X86 Opcode Reference 64-Bit Edition: General, System, x87 FPU, MMX, SSE (1), SSE2, SSE3, SSSE3 Opcodes
4 pages
Assembly Paper Key
No ratings yet
Assembly Paper Key
7 pages
Week 3 - Lecture
No ratings yet
Week 3 - Lecture
68 pages
Intel386 psABI 1.1
No ratings yet
Intel386 psABI 1.1
64 pages
Appendix A: 8085 Instruction Set Instructions Op Code Flags Main Effects
No ratings yet
Appendix A: 8085 Instruction Set Instructions Op Code Flags Main Effects
6 pages
8255 Sinusoidal Wave Alp
No ratings yet
8255 Sinusoidal Wave Alp
37 pages
branch instructions
No ratings yet
branch instructions
6 pages
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
No ratings yet
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
113 pages
7. COMS1015 Low Level Programming
No ratings yet
7. COMS1015 Low Level Programming
45 pages
pointers - com sci
No ratings yet
pointers - com sci
9 pages
Undocumented Cpu Behavior
No ratings yet
Undocumented Cpu Behavior
50 pages
1964 Recompiling Engine Documentation Documentation
No ratings yet
1964 Recompiling Engine Documentation Documentation
16 pages
Introduction-to-x86-Architecture
No ratings yet
Introduction-to-x86-Architecture
31 pages
Lec13 X86asm
No ratings yet
Lec13 X86asm
71 pages
8051 Basic Programs (Using Address)
No ratings yet
8051 Basic Programs (Using Address)
14 pages
DucHuy_CA_Lab2_2021
No ratings yet
DucHuy_CA_Lab2_2021
25 pages
I221154 F A2 Coal Merged
100% (1)
I221154 F A2 Coal Merged
17 pages
Advance Microprocessor
No ratings yet
Advance Microprocessor
53 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
126 pages
x86 Assembly
No ratings yet
x86 Assembly
17 pages
Ece4750 T01 Proc Concepts Problems
No ratings yet
Ece4750 T01 Proc Concepts Problems
10 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Blowfish Cipher Tutorials - Herong's Tutorial Examples
From Everand
Blowfish Cipher Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)
Project Proposal: Smart Health Card
100% (1)
Project Proposal: Smart Health Card
7 pages
Compiler Mcqs (Org)
No ratings yet
Compiler Mcqs (Org)
39 pages
Penn Mutual Case Study Final
No ratings yet
Penn Mutual Case Study Final
6 pages
How To Send Email in WordPress Using The Gmail SMTP Server
No ratings yet
How To Send Email in WordPress Using The Gmail SMTP Server
1 page
Assosa University: Collage of Computing and Informatics Department of Information Technology
No ratings yet
Assosa University: Collage of Computing and Informatics Department of Information Technology
11 pages
CHAPTER 4 Review Answers
No ratings yet
CHAPTER 4 Review Answers
4 pages
Net REVEAL Job Descripton PDF
No ratings yet
Net REVEAL Job Descripton PDF
3 pages
New Download Links
No ratings yet
New Download Links
14 pages
A1 Yash OS
No ratings yet
A1 Yash OS
9 pages
Text To Speech Converter Using Javascript: Madhav Institute of Technology & Science Gwalior
No ratings yet
Text To Speech Converter Using Javascript: Madhav Institute of Technology & Science Gwalior
3 pages
16. 2673267 - OpenODS View on CDS View
No ratings yet
16. 2673267 - OpenODS View on CDS View
2 pages
Write A Shell Script To Create A File in
No ratings yet
Write A Shell Script To Create A File in
8 pages
How To Use Efris Web Service Api: July 2, 2020
0% (1)
How To Use Efris Web Service Api: July 2, 2020
7 pages
Gigabyte P15 User Manual
No ratings yet
Gigabyte P15 User Manual
9 pages
SAP FS00 - Create General Ledger Account Centrally
100% (2)
SAP FS00 - Create General Ledger Account Centrally
20 pages
Getting Started With Passport 4400 and 6400/7400 Interworking
No ratings yet
Getting Started With Passport 4400 and 6400/7400 Interworking
44 pages
Sas Ques
No ratings yet
Sas Ques
63 pages
ISPF Table Example
No ratings yet
ISPF Table Example
31 pages
NI Mechatronics Machine Design Guide
No ratings yet
NI Mechatronics Machine Design Guide
46 pages
C++ Classes, Member Functions (Getters-Setters, Accessors-Mutators)
No ratings yet
C++ Classes, Member Functions (Getters-Setters, Accessors-Mutators)
24 pages
Recommendation For Sizing The Catalog
No ratings yet
Recommendation For Sizing The Catalog
4 pages
Angular Best Practices 20180412
No ratings yet
Angular Best Practices 20180412
33 pages
Rational Rose
No ratings yet
Rational Rose
12 pages
Leadership in Mobile Industry
No ratings yet
Leadership in Mobile Industry
30 pages
Shopping Cart
No ratings yet
Shopping Cart
152 pages
How I Cracked The AWS Solution Architect Cloud Quest. - DEV Community
No ratings yet
How I Cracked The AWS Solution Architect Cloud Quest. - DEV Community
8 pages
Speaking Worksheet: Don'T Be A Victim To Online Scams!
No ratings yet
Speaking Worksheet: Don'T Be A Victim To Online Scams!
1 page
DNV Leak 3.3 - Download Free Software PDF
No ratings yet
DNV Leak 3.3 - Download Free Software PDF
5 pages
Network Infrastructure Auditing Checklist
100% (1)
Network Infrastructure Auditing Checklist
2 pages

Anti Virus 2.0 "Compilers in Disguise": Mihai G. Chiriac Bitdefender

Uploaded by

Anti Virus 2.0 "Compilers in Disguise": Mihai G. Chiriac Bitdefender

Uploaded by

Anti Virus 2.

Cascade.1706 decryption loop

Win32.Parite (Pinfi) decryption loop

• …This results in unacceptable

• A small percentage of code is responsible

• So… we know what to optimize!

• Try to make them run as fast as possible

• Mike’s Intermediate Language Format

• We can follow the use-def chains and

mm0 = esi + edi Compute_ZF (esi)

• We can compute some values only if

mm0 = esi + edi

Easy! Create a different version for every variable state!

• Expensive, so it’s used

Win32.Harrier decryption loop (partial)

• Simply link the micro-functions that

You might also like