CS 242
Outline Java
Language Overview
History and design goals
Virtual machine overview
Loader and initialization Linker and verifier Bytecode interpreter
Classes and Inheritance
John Mitchell
Object features Encapsulation Inheritance
Method lookup
four different bytecodes
Types and Subtyping
Reading: Chapter 13 Primitive and ref types Interfaces; arrays Exception hierarchy Subtype polymorphism and generic programming
Verifier analysis Implementation of generics Security
Buffer overflow Java sandbox Type safety and attacks
Origins of the language
James Gosling and others at Sun, 1990 - 95 Oak language for set-top box
small networked device with television display
graphics execution of simple programs communication between local program and remote site no expert programmer to deal with crash, etc.
Design Goals
Portability
Internet-wide distribution: PC, Unix, Mac
Reliability
Avoid program crashes and error messages
Safety
Programmer may be malicious
Internet application
simple language for writing programs that can be transmitted over network
Simplicity and familiarity
Appeal to average programmer; less complex than C++
Efficiency
Important but secondary
General design decisions
Simplicity
Almost everything is an object All objects on heap, accessed through pointers No functions, no multiple inheritance, no go to, no operator overloading, few automatic coercions
Java System
The Java programming language Compiler and run-time system
Programmer compiles code Compiled code transmitted on network Receiver executes on interpreter (JVM) Safety checks made before/during execution
Portability and network transfer
Bytecode interpreter on many platforms
Reliability and Safety
Typed source and typed bytecode language Run-time type and bounds checks Garbage collection
Library, including graphics, security, etc.
Large library made it easier for projects to adopt Java Interoperability
Provision for native methods
Java Release History
1995 (1.0) First public release 1997 (1.1) Inner classes 2001 (1.4) Assertions
Verify programmers understanding of code
Enhancements in JDK 5 (= Java 1.5)
Generics
Polymorphism and compile-time type safety (JSR 14) For iterating over collections and arrays (JSR 201) Automatic conversion between primitive, wrapper types (JSR 201) Enumerated types with arbitrary methods and fields (JSR 201) Puts argument lists into an array; variable-length argument lists Avoid qualifying static members with class names (JSR 201)
Enhanced for Loop
Autoboxing/Unboxing Typesafe Enums Varargs
2004 (1.5) Tiger
Generics, foreach, Autoboxing/Unboxing, Typesafe Enums, Varargs, Static Import, Annotations, concurrency utility library
http://java.sun.com/developer/technicalArticles/releases/j2se15/
Static Import
Improvements through Java Community Process
Enables tools to generate code from annotations (JSR 175) Concurrency utility library, led by Doug Lea (JSR-166)
Annotations (Metadata)
Outline
Objects in Java
Classes, encapsulation, inheritance
Language Terminology
Class, object - as in other languages Field data member Method - member function Static members - class fields and methods this - self Package - set of classes in shared namespace Native method - method written in another language, often C
Type system
Primitive types, interfaces, arrays, exceptions
Generics (added in Java 1.5)
Basics, wildcards,
Virtual machine
Loader, verifier, linker, interpreter Bytecodes for method lookup
Security issues
Java Classes and Objects
Syntax similar to C++ Object
has fields and methods is allocated on heap, not run-time stack accessible through reference (only ptr assignment) garbage collected
Point Class
class Point { private int x; protected void setX (int y) {x = y;} public int getX() {return x;} Point(int xval) {x = xval;} // constructor };
Dynamic lookup
Similar in behavior to other languages Static typing => more efficient than Smalltalk Dynamic linking, interfaces => slower than C++ Visibility similar to C++, but not exactly (later slide)
Object initialization
Java guarantees constructor call for each object
Memory allocated Constructor called to initialize memory Some interesting issues related to inheritance
Well discuss later
Garbage Collection and Finalize
Objects are garbage collected
No explicit free Avoids dangling pointers and resulting type errors
Problem
What if object has opened file or holds lock?
Cannot do this (would be bad C++ style anyway):
Obj* obj = (Obj*)malloc(sizeof(Obj));
Solution
finalize method, called by the garbage collector
Before space is reclaimed, or when virtual machine exits Space overflow is not really the right condition to trigger finalization when an object holds a lock...)
Static fields of class initialized at class load time
Talk about class loading later
Important convention: call super.finalize
Encapsulation and packages
Every field, method belongs to a class Every class is part of some package
Can be unnamed default package File declares which package code belongs to package class field method package class field method
Visibility and access
Four visibility distinctions
public, private, protected, package
Method can refer to
private members of class it belongs to non-private members of all classes in same package protected members of superclasses (in diff package) public members of classes in visible packages
Visibility determined by files system, etc. (outside language)
Qualified names (or use import)
java.lang.String.substring()
package class method
Inheritance
Similar to Smalltalk, C++ Subclass inherits from superclass
Single inheritance only (but Java has interfaces)
Example subclass
class ColorPoint extends Point { // Additional fields and methods private Color c; protected void setC (Color d) {c = d;} public Color getC() {return c;} // Define constructor ColorPoint(int xval, Color cval) { super(xval); // call Point constructor c = cval; } // initialize ColorPoint field };
Some additional features
Conventions regarding super in constructor and finalize methods Final classes and methods
Class Object
Every class extends another class
Superclass is Object if no other class named
Constructors and Super
Java guarantees constructor call for each object This must be preserved by inheritance
Subclass constructor must call super constructor
If first statement is not call to super, then call super() inserted automatically by compiler If superclass does not have a constructor with no args, then this causes compiler error (yuck) Exception to rule: if one constructor invokes another, then it is responsibility of second constructor to call super, e.g., ColorPoint() { ColorPoint(0,blue);} is compiled without inserting call to super
Methods of class Object
GetClass return the Class object representing class of the object ToString returns string representation of object equals default object equality (not ptr equality) hashCode Clone makes a duplicate of an object wait, notify, notifyAll used with concurrency finalize
Different conventions for finalize and super
Compiler does not force call to super finalize
Final classes and methods
Restrict inheritance
Final classes and methods cannot be redefined
Outline
Objects in Java
Classes, encapsulation, inheritance
Example
java.lang.String
Type system
Primitive types, interfaces, arrays, exceptions
Reasons for this feature
Important for security
Programmer controls behavior of all subclasses Critical because subclasses produce subtypes
Generics (added in Java 1.5)
Basics, wildcards,
Virtual machine
Loader, verifier, linker, interpreter Bytecodes for method lookup
Compare to C++ virtual/non-virtual
Method is virtual until it becomes final
Security issues
Java Types
Two general kinds of types
Primitive types not objects
Integers, Booleans, etc
Classification of Java types
Reference Types
Object Object[ ] Throwable
Reference types
Classes, interfaces, arrays No syntax distinguishing Object * from Object
Static type checking
Every expression has type, determined from its parts Some auto conversions, many casts are checked at run time Example, assuming A <: B
If A x, then can use x as argument to method that requires B If B x, then can try to cast x to A Downcast checked at run-time, may raise exception
Shape Circle Square
Shape[ ] Circle[ ] Square[ ] arrays
Exception types
user-defined Primitive Types boolean int byte
float
long
Subtyping
Primitive types
Conversions: int -> long, double -> long,
Java class subtyping
Signature Conformance
Subclass method signatures must conform to those of superclass
Class subtyping similar to C++
Subclass produces subtype Single inheritance => subclasses form tree
Three ways signature could vary
Argument types Return type Exceptions How much conformance is needed in principle?
Interfaces
Completely abstract classes
no implementation
Multiple subtyping
Interface can have multiple subtypes (implements, extends)
Java rule
Java 1.1: Arguments and returns must have identical types, may remove exceptions Java 1.5: covariant return type specialization
Arrays
Covariant subtyping not consistent with semantic principles
Interface subtyping: example
interface Shape { public float center(); public void rotate(float degrees); } interface Drawable { public void setColor(Color c); public void draw(); } class Circle implements Shape, Drawable { // does not inherit any implementation // but must define Shape, Drawable methods }
Properties of interfaces
Flexibility
Allows subtype graph instead of tree Avoids problems with multiple inheritance of implementations (remember C++ diamond)
Cost
Offset in method lookup table not known at compile Different bytecodes for method lookup
one when class is known one when only interface is known search for location of method cache for use next time this call is made (from this line) More about this later
Array types
Automatically defined
Array type T[ ] exists for each class, interface type T Cannot extended array types (array types are final) Multi-dimensional arrays are arrays of arrays: T[ ] [ ]
Array subtyping
Covariance
if S <: T then S[ ] <: T[ ]
Standard type error
class A {} class B extends A {} B[ ] bArray = new B[10] A[ ] aArray = bArray // considered OK since B[] <: A[] aArray[0] = new A() // compiles, but run-time error
// raises ArrayStoreException
Treated as reference type
An array variable is a pointer to an array, can be null Example: Circle[] x = new Circle[array_size] Anonymous array expression: new int[] {1,2,3, ... 10}
Every array type is a subtype of Object[ ], Object
Length of array is not part of its static type
Covariance problem again
Remember Simula problem
If A <: B, then A ref <: B ref Needed run-time test to prevent bad assignment Covariance for assignable cells is not right in principle
Afterthought on Java arrays
Date: Fri, 09 Oct 1998 09:41:05 -0600 From: bill joy Subject: [discussion about java genericity] actually, java array covariance was done for less noble reasons : it made some generic "bcopy" (memory copy) and like operations much easier to write... I proposed to take this out in 95, but it was too late (...). i think it is unfortunate that it wasn't taken out... it would have made adding genericity later much cleaner, and [array covariance] doesn't pay for its complexity today. wnj
Explanation
interface of T reference cell is
put : T T ref get : T ref T
Remember covariance/contravariance of functions
Java Exceptions
Similar basic functionality to ML, C++
Constructs to throw and catch exceptions Dynamic scoping of handler
Exception Classes
Throwable Exception checked exceptions Runtime Exception Error
Some differences
An exception is an object from an exception class Subtyping between exception classes
Use subtyping to match type of exception or pass it on Similar functionality to ML pattern matching in handler
User-defined exception classes
Unchecked exceptions
Type of method includes exceptions it can throw
Actually, only subclasses of Exception (see next slide)
If a method may throw a checked exception, then this must be in the type of the method
Try/finally blocks
Exceptions are caught in try blocks
try { statements } catch (ex-type1 identifier1) { statements } catch (ex-type2 identifier2) { statements } finally { statements }
Why define new exception types?
Exception may contain data
Class Throwable includes a string field so that cause of exception can be described Pass other data by declaring additional fields or methods
Subtype hierarchy used to catch exceptions
catch <exception-type> <identifier> { } will catch any exception from any subtype of exception-type and bind object to identifier
Implementation: finally compiled to jsr
Outline
Objects in Java
Classes, encapsulation, inheritance
Java Generic Programming
Java has class Object
Supertype of all object types This allows subtype polymorphism
Can apply operation on class T to any subclass S <: T
Type system
Primitive types, interfaces, arrays, exceptions
Generics (added in Java 1.5)
Basics, wildcards,
Java 1.0 1.4 did not have templates
No parametric polymorphism Many considered this the biggest deficiency of Java
Virtual machine
Loader, verifier, linker, interpreter Bytecodes for method lookup
Java type system does not let you cheat
Can cast from supertype to subtype Cast is checked at run time
Security issues
Example generic construct: Stack
Stacks possible for any type of object
For any type t, can have type stack_of_t Operations push, pop work for any type
Java 1.0
vs Generics
class Stack<A> { void push(A a) { ... } A pop() { ... } ...} String s = "Hello"; Stack<String> st = new Stack<String>(); st.push(s); ... s = st.pop();
In C++, would write generic stack class
template <type t> class Stack { private: t data; Stack<t> * next; public: void push (t* x) { } t* pop ( ){} };
class Stack { void push(Object o) { ... } Object pop() { ... } ...} String s = "Hello"; Stack st = new Stack(); ... st.push(s); ... s = (String) st.pop();
What can we do in Java 1.0?
Why no generics in early Java ?
Many proposals Basic language goals seem clear Details take some effort to work out
Exact typing constraints Implementation
Existing virtual machine? Additional bytecodes? Duplicate code for each instance? Use same code (with casts) for all instances
JSR 14 Java Generics (Java 1.5, Tiger)
Adopts syntax on previous slide Adds auto boxing/unboxing
User conversion Stack<Integer> st = new Stack<Integer>(); st.push(new Integer(12)); ... int i = (st.pop()).intValue(); Automatic conversion Stack<Integer> st = new Stack<Integer>(); st.push(12); ... int i = st.pop();
Java Community proposal (JSR 14) incorporated into Java 1.5
Java generics are type checked
A generic class may use operations on objects of a parameter type
Example: PriorityQueue<T> if x.less(y) then
Example: Hash Table
interface Hashable { int HashCode (); }; class HashTable < Key extends Hashable, Value> { void Insert (Key k, Value v) { int bucket = k.HashCode(); InsertAt (bucket, k, v); } This expression must typecheck }; Use Key extends Hashable
Two possible solutions
C++: Link and see if all operations can be resolved Java: Type check and compile generics w/o linking
This requires programmer to give information about type parameter Example: PriorityQueue<T extends ...>
Priority Queue Example
interface Comparable<I> { boolean lessThan(I); } class PriorityQueue<T extends Comparable<T>> { T queue[ ] ; void insert(T t) { ... if ( t.lessThan(queue[i]) ) ... } T remove() { ... } ... } Why is this form needed? Less: t t t is contravariant in t
Another example
interface LessAndEqual<I> { boolean lessThan(I); boolean equal(I); } class Relations<C extends LessAndEqual<C>> extends C { boolean greaterThan(Relations<C> a) { return a.lessThan(this); } boolean greaterEqual(Relations<C> a) { return greaterThan(a) || equal(a); } boolean notEqual(Relations<C> a) { ... } boolean lessEqual(Relations<C> a) { ... } ... }
Implementing Generics
Type erasure
Compile-time type checking uses generics Compiler eliminates generics by erasing them
Compile List<T> to List, T to Object, insert casts
Outline
Objects in Java
Classes, encapsulation, inheritance
Type system
Primitive types, interfaces, arrays, exceptions
Generics (added in Java 1.5)
Basics, wildcards,
Generics are not templates
Generic declarations are typechecked Generics are compiled once and for all
No instantiation No code bloat
Virtual machine
Loader, verifier, linker, interpreter Bytecodes for method lookup Bytecode verifier (example: initialize before use) Implementation of generics
More later when we talk about virtual machine
Security issues
Java Implementation
Compiler and Virtual Machine
Compiler produces bytecode Virtual machine loads classes on demand, verifies bytecode properties, interprets bytecode
Java Virtual Machine Architecture
A.java Java Compiler Compile source code Java Virtual Machine Loader
rk wo Net
A.class
Why this design?
Bytecode interpreter/compilers used before
Pascal pcode; Smalltalk compilers use bytecode
Minimize machine-dependent part of implementation
Do optimization on bytecode when possible Keep bytecode interpreter simple
Verifier Linker Bytecode Interpreter
B.class
For Java, this gives portability
Transmit bytecode across network
JVM memory areas
Java program has one or more threads Each thread has its own stack All threads share same heap
Class loader
Runtime system loads classes as needed
When class is referenced, loader searches for file of compiled bytecode instructions
Default loading mechanism can be replaced
Define alternate ClassLoader object
Extend the abstract ClassLoader class and implementation ClassLoader does not implement abstract method loadClass, but has methods that can be used to implement loadClass
method area
heap
Java stacks
PC registers
native method stacks
Can obtain bytecodes from alternate source
VM restricts applet communication to site that supplied applet
Example issue in class loading and linking:
Static members and initialization
class ... { /* static variable with initial value */ static int x = initial_value /* ---- static initialization block --- */ static { /* code executed once, when loaded */ } }
JVM Linker and Verifier
Linker
Adds compiled class or interface to runtime system Creates static fields and initializes them Resolves names
Checks symbolic names and replaces with direct references
Verifier
Check bytecode of a class or interface before loaded Throw VerifyError exception if error occurs
Initialization is important
Cannot initialize class fields until loaded
Static block cannot raise an exception
Handler may not be installed at class loading time
Verifier
Bytecode may not come from standard compiler
Evil hacker may write dangerous bytecode
Bytecode interpreter
Standard virtual machine interprets instructions
Perform run-time checks such as array bounds Possible to compile bytecode class file to native code
Verifier checks correctness of bytecode
Every instruction must have a valid operation code Every branch instruction must branch to the start of some other instruction, not middle of instruction Every method must have a structurally correct signature Every instruction obeys the Java type discipline Last condition is fairly complicated .
Java programs can call native methods
Typically functions written in C
Multiple bytecodes for method lookup
invokevirtual - when class of object known invokeinterface - when interface of object known invokestatic - static methods invokespecial - some special cases
Type Safety of JVM
Run-time type checking
All casts are checked to make sure type safe All array references are checked to make sure the array index is within the array bounds References are tested to make sure they are not null before they are dereferenced.
JVM uses stack machine
Java
Class A extends Object { int i void f(int val) { i = val + 1;} }
JVM Activation Record local variables operand stack data area
Return addr, exception info, Const pool res.
Bytecode
Method void f(int) aload 0 ; object ref this iload 1 ; int val iconst 1 iadd ; add val +1 putfield #4 <Field int i> return refers to const pool
Additional features
Automatic garbage collection No pointer arithmetic If program accesses memory, that memory is allocated to the program and declared with correct type
Field and method access
Instruction includes index into constant pool
Constant pool stores symbolic names Store once, instead of each instruction, to save space
invokeinterface <method-spec>
Sample code
void add2(Incrementable x) { x.inc(); x.inc(); }
First execution
Use symbolic name to find field or method
Search for method
find class of the object operand (operand on stack)
must implement the interface named in <method-spec>
Second execution
Use modified quick instruction to simplify search
search the method table for this class find method with the given name and signature
Call the method
Usual function call with new activation record, etc.
10
Why is search necessary?
interface Incrementable { public void inc(); } class IntCounter implements Incrementable { public void add(int); public void inc(); public int value(); } class FloatCounter implements Incrementable { public void inc(); public void add(float); public float value(); }
invokevirtual <method-spec>
Similar to invokeinterface, but class is known Search for method
search the method table of this class find method with the given name and signature
Can we use static type for efficiency?
Each execution of an instruction will be to object from subclass of statically-known class Constant offset into vtable
like C++, but dynamic linking makes search useful first time
See next slide
Bytecode rewriting: invokevirtual
Bytecode invokevirtual Constant pool A.foo()
Bytecode rewriting: invokeinterface
Bytecode invokeinterface Constant pool A.foo()
inv_virt_quick vtable offset
inv_int_quick
A.foo()
After search, rewrite bytcode to use fixed offset into the vtable. No search on second execution.
Cache address of method; check class on second use
Bytecode Verifier
Lets look at one example to see how this works Correctness condition
No operations should be invoked on an object until it has been initialized
Object creation
Example:
Point p = new Point(3) 1: new Point 2: dup 3: iconst 3 4: init Point Java source
Bytecode instructions
new class allocate memory for object init class initialize object on top of stack use class use object on top of stack
bytecode
No easy pattern to match Multiple refs to same uninitialized object
Need some form of alias analysis
11
Alias Analysis
Other situations:
1: new P 2: new P 3: init P
Tracking initialize-before-use
Alias analysis uses line numbers or
new P init P Two pointers to unitialized object created at line 47 are assumed to point to same object All accessible objects must be initialized before jump backwards (possible loop)
Oversight in treatment of local subroutines
Used in implementation of try-finally Object created in finally not necessarily initialized
Equivalence classes based on line where object was created.
No clear security consequence
Bug fixed Have proved correctness of modified verifier for init
Bug in Suns JDK 1.1.4
Example:
1: 2: 3: 4: 5: 6: 7: 8: 9: jsr 10 store 1 jsr 10 store 2 load 2 init P load 1 use P halt 10: store 0 11: new P 12: ret 0 variables 1 and 2 contain references to two different objects which are both uninitialized object created on line 11
Implementing Generics
Two possible implementations
Heterogeneous: instantiate generics Homogeneous: translate generic class to standard class
Example for next few slides: generic list class
template <type t> class List { private: t* data; List<t> * next; public: void Cons (t* x) { } t* Head ( ){} List<t> Tail ( ){} };
Homogeneous Implementation
Heterogeneous Implementation
data next data next
next
next
next next
Same representation and code for all types of data
Specialize representation, code according to type
12
Issues
Data on heap, manipulated by pointer
Every list cell has two pointers, data and next All pointers are same size Can use same representation, code for all types
Heterogeneous Implementation
Compile generic class C<param>
Check use of parameter type according to constraints Produce extended form of bytecode class file
Store constraints, type parameter names in bytecode file
Data stored in local variables
List cell must have space for data Different representation for different types Different code if offset of fields built into code
Expand when class C<actual> is loaded
Replace parameter type by actual class Result is ordinary class file This is a preprocessor to the class loader:
No change to the virtual machine No need for additional bytecodes
Example: Hash Table
interface Hashable { int HashCode (); }; class HashTable < Key implements Hashable, Value> { void Insert (Key k, Value v) { int bucket = k.HashCode(); InsertAt (bucket, k, v); } };
Generic bytecode with placeholders
void Insert (Key k, Value v) { int bucket = k.HashCode(); InsertAt (bucket, k, v); } Method void Insert($1, $2) aload_1 invokevirtual #6 <Method $1.HashCode()I> istore_3 aload_0 iload_3 aload_1 aload_2 invokevirtual #7 <Method HashTable<$1,$2>. InsertAt(IL$1;L$2;)V> return
Instantiation of generic bytecode
void Insert (Key k, Value v) { int bucket = k.HashCode(); InsertAt (bucket, k, v); } Method void Insert(Name, Integer) aload_1 invokevirtual #6 <Method Name.HashCode()I> istore_3 aload_0 iload_3 aload_1 aload_2 invokevirtual #7 <Method HashTable<Name,Integer> InsertAt(ILName;LInteger;)V> return
Load parameterized class file
Use of HashTable <Name, Integer> invokes loader Several preprocess steps
Locate bytecode for parameterized class, actual types Check the parameter constraints against actual class Substitute actual type name for parameter type Proceed with verifier, linker as usual.
Can be implemented with ~500 lines Java code
Portable, efficient, no need to change virtual machine
13
Java 1.5 Implementation
Homogeneous implementation
class Stack<A> { void push(A a) { ... } A pop() { ... } ...} class Stack { void push(Object o) { ... } Object pop() { ... } ...}
Some details that matter
Allocation of static variables
Heterogeneous: separate copy for each instance Homogenous: one copy shared by all instances
Constructor of actual class parameter
Heterogeneous: class G<T> T x = new T; Homogenous: new T may just be Object !
Algorithm
replace class parameter <A> by Object, insert casts if <A extends B>, replace A by B
Resolve overloading
Heterogeneous: could try to resolve at instantiation time (C++) Homogenous: no information about type parameter
When is template instantiated?
Compile- or link-time (C++) Java alternative: class load time
Why choose this implementation?
Backward compatibility of distributed bytecode Surprise: sometimes faster because class loading slow
Outline
Objects in Java
Classes, encapsulation, inheritance
Java Security
Security
Prevent unauthorized use of computational resources
Type system
Primitive types, interfaces, arrays, exceptions
Java security
Java code can read input from careless user or malicious attacker Java code can be transmitted over network code may be written by careless friend or malicious attacker
Generics (added in Java 1.5)
Basics, wildcards,
Virtual machine
Loader, verifier, linker, interpreter Bytecodes for method lookup Bytecode verifier (example: initialize before use) Implementation of generics
Security issues
Java is designed to reduce many security risks
Java Security Mechanisms
Sandboxing
Run program in restricted environment
Analogy: childs sandbox with only safe toys
Buffer Overflow Attack
Most prevalent security problem today
Approximately 80% of CERT advisories are related to buffer overflow vulnerabilities in OS, other code
This term refers to
Features of loader, verifier, interpreter that restrict program Java Security Manager, a special object that acts as access control gatekeeper
General network-based attack
Attacker sends carefully designed network msgs Input causes privileged program (e.g., Sendmail) to do something it was not designed to do
Code signing
Use cryptography to establish origin of class file
This info can be used by security manager
Does not work in Java
Illustrates what Java was designed to prevent
14
Sample C code to illustrate attack
void f (char *str) { char buffer[16]; strcpy(buffer,str); } void main() { char large_string[256]; int i; for( i = 0; i < 255; i++) large_string[i] = 'A'; f(large_string); }
Java Sandbox
Four complementary mechanisms
Class loader
Separate namespaces for separate class loaders Associates protection domain with each class
Function
Copies str into buffer until null character found Could write past end of buffer,
over function retun addr
Calling program
Writes 'A' over f activation record Function f returns to location 0x4141414141 This causes segmentation fault
Verifier and JVM run-time tests
NO unchecked casts or other type errors, NO array overflow Preserves private, protected visibility levels
Security Manager
Called by library functions to decide if request is allowed Uses protection domain associated with code, user policy Recall: stack inspection problem on midterm
Variations
Put meaningful address in string Put code in string and jump to it !!
See: Smashing the stack for fun and profit
Why is typing a security feature?
Sandbox mechanisms all rely on type safety Example
Unchecked C cast lets code make any system call
int (*fp)() ... fp = addr; (*fp)(n); /* variable "fp" is a function pointer */
Security Manager
Java library functions call security manager Security manager object answers at run time
Decide if calling code is allowed to do operation Examine protection domain of calling class
Signer: organization that signed code before loading Location: URL where the Java classes came from
/* assign address stored in an integer var */ /* call the function at this address */
Uses the system policy to decide access permission
Other examples involving type confusion in book
Sample SecurityManager methods
checkExec checkRead checkWrite checkListen checkConnect checkCreate ClassLoader Checks if the system commands can be executed. Checks if a file can be read from. Checks if a file can be written to. Checks if a certain network port can be listened to for connections. Checks if a network connection can be created. Check to prevent the installation of additional ClassLoaders.
Stack Inspection
Permission depends on
Permission of calling method Permission of all methods above it on stack
Up to method that is trusted and asserts this trust
method f method g method h java.io.FileInputStream
Many details omitted here
Stories: Netscape font / passwd bug; Shockwave plug-in
15
Java Summary
Objects
have fields and methods alloc on heap, access by pointer, garbage collected
Java Summary (II)
Subtyping
Determined from inheritance hierarchy Class may implement multiple interfaces
Classes
Public, Private, Protected, Package (not exactly C++) Can have static (class) members Constructors and finalize methods
Virtual machine
Load bytecode for classes at run time Verifier checks bytecode Interpreter also makes run-time checks
type casts array bounds
Inheritance
Single inheritance Final classes and methods
Portability and security are main considerations
Some Highlights
Dynamic lookup
Different bytecodes for by-class, by-interface Search vtable + Bytecode rewriting or caching
Comparison with C++
Almost everything is object + Simplicity - Efficiency
except for values from primitive types
Subtyping
Interfaces instead of multiple inheritance Awkward treatment of array subtyping (my opinion)
Type safe
Generics
Type checked, not instantiated, some limitations (<T>new T)
+ Safety +/- Code complexity - Efficiency Arrays are bounds checked No pointer arithmetic, no unchecked type casts Garbage collected
Bytecode-based JVM
Bytcode verifier Security: security manager, stack inspection
Interpreted
+ Portability + Safety - Efficiency Compiled to byte code: a generalized form of assembly language designed to interpret quickly. Byte codes contain type information
Comparison
Objects accessed by ptr
(contd)
Links for material not in book
Enhancements in JDK 5
http://java.sun.com/j2se/1.5.0/docs/guide/language/i ndex.html
+ Simplicity - Efficiency No problems with direct manipulation of objects
Garbage collection: + Safety + Simplicity - Efficiency
Needed to support type safety
J2SE 5.0 in a Nutshell
http://java.sun.com/developer/technicalArticles/relea ses/j2se15/
Built-in concurrency support + Portability
Used for concurrent garbage collection (avoid waiting?) Concurrency control via synchronous methods Part of network support: download data while executing
Generics
http://www.langer.camelot.de/Resources/Links/Java Generics.htm
Exceptions
As in C++, integral part of language design
16
17