Merri Beth Lavagnino - Privacy and Policy
Policy and privacy are really consideration of the human aspects and impacts of technology. Policies are: strategic direction and operating philosophy (which are usually informal and cultural), Public and Institutional policies (these are both documented and usually legal documents).
Institutional policy - a statement that reflect the philosophies and values of the project, service, organization or federation. Policies should be clear and concise, applicable across a wide range of activities and should not change very much.
Why create a policy?
Where does the policy apply? Federation, Institution, Service
Real-life stories:
“A policy is a temporary creed liable to be changed, but while it holds good it has got to be pursued with apostolic zeal.” Mohandas K. Gandhi
Privacy:
Categories of privacy harms:
Fair Information Practice Principles: The FTC drafted these principles and they do enforce them. Higher Ed is not under the FTC’s jurisdiction but users are expecting these principles to be met. If we don’t
Ken Klingenstein: Federated Identity and Data Protection Law
Good quote from Ken K: “This is an attempt to bring trust to internet via technology not just because it is just us chickens”.
EU Law Directive 95/46/EC : You can process personal data when it is required to perform contact, required to satisfy legal duty or consent.
Identity Providers must identify which services are necessary for education and research. Must inform the users. May seek users’ informed freed consent to release personal data to other services. You have to show why it is important. Should have a data process/data controller agreement with all service providers to whom personally identifiable data is released. Must ensure adequate protection of any data released to services outside the EU. We have to play by the EU rules.
Service Providers must consider whether personally identifiable information is necessary for their service or whether anonymous identifiers are sufficient. You may request personal information from users but you must inform.
There is no normalized definition of what Personal Identifiable Information (PII). There are questions about email addresses: if it is a third party email address it might not be but a .edu address might be. So the content might be more important than the field.
IP Addresses - if it is a dynamic address it is not PII. So, unless you know it is a dynamic address, then you have to treat it as PII.
EduPerson Targeted ID - this is going to the EU privacy commission this Fall. It is a 32 bit opaque identifier that is different per site visited.
OASIS Cross-Enterprise Security and Privacy Authorization (XSPA) - just formed group. A mechanism to allow consent agreements flow with data. The first and dominant Use Case is health care. Looking for other Use Cases. Does this make consent a new service in our loosely coupled service? Do services need to be consent aware?
Report Out from Discussion Sessions:
Data Modeling Group:
Modeling person and organization data. Modeling of organization data is remarkably difficult not just in the nature of the data but also in the resistance that you get from organizations to being characterized. Multiple organization charts - financial, hr and reporting structure. The characterizations can be political. Are there pressures that will lead to the marginalization old way of doing things? Organizations that don’t want to be characterized may not get services.
Service Discovery:
What would a service description look like: what is it called, cost, how to call it, operational context (where is it physically located). Discussion about how you describe the service, how do you recognize similar services in distributed locations. Talked about the grid is doing this with their RNA.
What is happening today: people using Google to search for services and looking for a WSDL.
How do you get consent? What about promises and claims? What about a directory of all the services? What about a directory of directory? You could have a convention for naming the directory so you could at least find the directories.
DNS works for finding things.
Governance:
Domain Governance - governance revolves around an application or a data element, or attribute (student ID). These models will have to evolve to domain governance: enrollment, IdM etc.
Who owns the data especially as the data is transformed and sent along the ESB? Services are requesting the data that can then be used by other services.
SLAs - keeping tracking of who can use the use the service.
The need for a directory of services especially in emergency notification. There is also a need to know who is consuming services so you can notify on changes.
What is being done now on campuses? It is evolving on campuses. Identity and Access Management is a domain that is being governed as a domain at Penn State.
Saint Louis University has a good examples of domains in higher education that need to be governed as a domain.
Lightening Talks:
Rob Carter: Tracking and Authenticating IP in Cyberspace
We had all of our resources stored inside the walls of the institution. We now see with cloud computing and Web 2.0 applications, our intellectual property out in the cloud. How do we track the reuse of them? How do we contextualize the content.
How do we know that it is really and artifact of mine and not someone spoofing my creations?
Could solve this with digital signatures. What if we could add metadata before it goes out into the cloud. Get a signature of the object and attach the signature to the object or store it elsewhere.
How does this align with Creative Commons licensing efforts. You can search and crawl for for CC licensed objects that you use.
Loretta Auvil: Music Analysis.
Dynamic analysis of a Tom Lehrer file. Very entertaining.
Scotty Logan: IAM Services and Well Behaved Apps
If every app does its own thing, there is no real management.
Trust the container: Identity - you can get a user name from Tomcat et al, Authentication, Authorization
Have the container provider the groups and privileges as a URI
OAuth.net - a specification developed by a group to solve the “I want my Flickr protected photos on Facebook but I don’t want to give you my Flickr username and password”.
Technorati Tags: CAMP, EDUCAUSE, IT Architecture, Policy

Dave Gimpl: Computing as a Service
Infrastructure for vaporware. They are working on the infrastructure that enables cloud-computing.
Challenges in the data center: rising costs of the operations, the explosion of data, the difficulty of deploying new application and services, the difficulty in managing complex virtual machine systems. When you map the business processes, they map to a variety of systems on the data center floor.
Blue Cloud is IBM’s entry in Cloud Computing. Cloud Computing is holistic systems management. Similar to Grid or Cluster computing. A combination of “pervasive virtualization” for both server and storage. Allows for virtualization across varied hardware (I think). On demand and autonomic management and Utility Computing (Amazon’s service offering).
They gather up like systems (not necessarily identical) and manage them as a pool. The focus changes from managing the SAN or server. You let the “ensemble” manage itself and you manage the Virtual Image.
When the image moves to another system, does it move with state?
North Carolina State’s implementation is open source. All of the standards are open source. The ensembles are wrapped with SOAP/SOA interfaces. At North Carolina State Virtual Compute Lab - a student can request a XP machine to do their project. They get the machine in increments of 30 minutes. They are providing service for other institutions in their area.
Ken Klingenstein mentions a paper “The Computational Data Center: The Science Cloud”
Mark Morgan: Genesis II - Accessible, Standards Based Grid Computing
http://www.cs.virginia.edu/~vcgr
The problems: we have target grid user that are unable or unwilling to learn new programming tools & paradigms. Users want the benefit of the grid without having to know about the grid.
Anything you can put a service in front of and put on the internet, is part of the grid. Telescopes, microscopes, computing power, storage, data, sensors.
Want to share this but sharing in a mutually distrustful domain.
Genesis II implements the standards that come out of the OGF (Open Grid Foundation) to test them and vet them. Open Grid Service Architecture is part of the OGF.
Grids have been around for a long time but they are being used. People who design grids want cool features. User don’t care. Genesis II is focused on the user and making grids usable.
The Specs:
You interact with the grid system in “file-like” ways. Double click on a database query, drag a job onto a server resource, etc.
They use an FTP interface to manage resources on the grid. On linux side, OGRSH acts as an intermediary between bash and the grid. Users can do “ls”, “cat”, “cp” and OGRSH will redirect requests into the grid as appropriate.
Nigel Watling: Cloud Computing and the Internet Service Bus
http://biztalk.net
Building out a new data center in Chicago. Microsoft is deploying 10,000 servers a month to support cloud computing. Amazon expects their services operation to bypass the retail business soon.
Issues that come up:
Connect their composite application through an ESB to the internal applications and then out to the cloud for distributed resources.
Roland Hedberg: OM2
http://www.openmetadir.org
OM2 is about representing events and moving information about events from one place to another. A publish-subscribe messaging system originally designed around IdM. Implementations in Python, Java and PERL.
Three ontologies: message, operation and object ontologies. Message is the header like for mail. Operation describes the actions (Miro ontology) which includes if-then-else as well as the usual add, modify, etc. Objects describe the objects.
Messages are based on RDF/XML. Includes support for Dynamic delegation Discovery System (DDDS, RFC 3401-3).
“Ontology Driven Application Development.”
Example applications:
Eduroam (http://www.eduroam.org) : allows you to travel between universities throughout Europe and use your local credentials to authenticate to the wireless network.
Bologna Process: supporting the movement of students between universities. Any student should be able to go another university and take a class then come back. Has admissions control and grade reporting.
What OM2 does: Transport the information to the correct address at all time by the use of DDDS, by the transport protocol of the receivers choice.
Brian Busby: ESB at UW-Madison
Talk about our use of the ESB and experience with SOA.
UW-System has been looking at SOA for years (4 or 5 years). We got to where we were going to buy a commercial SOA suite but we passed on the purchase. SOA went into hibernation. Then two projects came along:
We made a decision to take advantage of a license for the Cape Clear ESB. We can take advantage of this.
Interesting impact: people suddenly had to change their discussion to be around services that they need not big data loads or APIs and they made the change.
Issues:
Organization Issues:
The fact that we got the ESB in place is driving the conversations that we were having years ago forward finally.
Technorati Tags: EDUCAUSE, IT Architecture, Jim Phelps, Service Oriented Architecture, SOA
