6.005 Elements of Software Construction | Fall 2012
Project 2: Realtime collaborative editing

Due: November 27, 2012 @ Midnight;
December 4, 2012 @ Midnight; and
December 12, 2012 @ Midnight

Problem
Purpose
Specification
Deliverables and Grading
Hints
Resources

Problem

A collaborative editor allows multiple users to work on a single document simultaneously, across a network. Examples include Writely and Etherpad (both acquired by Google), SynchroEdit, ZohoWriter and many others. In this project, you will build your own collaborative editor. This is a challenging application, since it involves handling concurrency (so that simultaneous actions interleave in a reasonable way), dealing with network connections and building a graphical user interface. It should also be fun because you'll be building a tool you can use yourself. You'll become familiar with a variety of technologies, and you'll have an opportunity to practice applying the fundamental ideas you've learned in the course and working on a team.

Purpose

The purpose of this project is threefold. First, you will have an opportunity to practice using the fundamental design and implementation techniques you've learned in the course -- most notably, mutable datatypes, but also ideas from the earlier paradigms (state machines, functions over immutable types, data abstraction, etc.).

Second, the project will give you practice using a variety of technologies -- networking, sockets and I/O, and Swing (Java's GUI toolkit) -- and will help you deepen your understanding of common design patterns, such as view hierarchy and model-view-controller for user interfaces, publish/subscribe patterns (such as Listeners), and queue-based treatments of concurrency.

Third, you'll have another chance to work on a team, this time for a slightly more ambitious project.

Specification

You have considerable freedom to design your editor as you choose, not only in the implementation, but also in designing the behavior. We recommend, however, that you design and implement for the minimal required functionality first, and add extensions only when you are confident that the core is stable and solid.

Some of the decisions you will have to make are:

what set of editing actions are provided;
how the document is structured;
how documents are named and accessed by users;
where documents are stored (e.g. at a central server or on clients);
what guarantees are made about the effects of concurrent edits

Nevertheless, your tool must meet some minimal requirements:

Real-time responsiveness. Users should be able to work on a document together in real time, without appreciable delays between making and observing edits.
Multiple users. It should be possible for at least three users to be able to work on the same document at once, and the system should allow multiple documents to be active at once.
Textual format. Your editor should allow, at a minimum, editing of ASCII text and should handle line-breaks appropriately.
Persistence. Users should be able to edit a document over several network sessions, with the server maintaining the document even when no users are currently connected. Note that persisting to disk is not necessary, so it's okay if shutting down the server erases all the documents. But the state of documents should persist across client disconnects and reconnects.

Tasks

Group repository: Pull your team repo from:/afs/athena.mit.edu/course/6/6.005/git/fa12/projects/rtce/member1-member2-member3.git

Create a New Project in Package Explorer.
Uncheck the Use Default Location checkbox.
Set the new location to the location of your team git repository.

Abstract design: Using state machines and datatypes, explore the behavioral design of your tool. In particular, define carefully and precisely what an edit is. Make sure to record not only the decisions you made, but the decisions you rejected, along with their rationale.
Protocol: Design a protocol for the system's components. Your protocol should be a state machine representing the state of communication, and a set of messages passed between the client and server. Express your protocol as a state machine, and give the structure of the protocol messages using a grammar.
User interface design: Sketch your user interface and its various screens and dialogs. Use these sketches to explore alternatives and to plan the structure and flow of your interface. Sketching on paper is recommended and make sure to scan and upload your diagrams as a PDF. For a list of scanners see resources.
Code design: This is not a design document. You will upload the structure of your code as Java files. This will include all major classes and methods. Remember to write detailed method and class specs and record important design decisions on top of your files -- in particular all the patterns you chose (see the lecture notes for reminders of all the patterns covered in class) -- and their rationale. Pay particular attention to concurrency design, and to making it easy to run automated tests.
Testing strategy: Devise a strategy for testing your system (determining what order you will test components in, how you will derive test cases, what level of coverage you plan to achieve, how you will test the user interface, etc.). Justify your strategy with an argument that the level of effort is appropriate for the likely prevalence and location of bugs, and the degree of reliability required. Document your testing strategy for all components of the system in your code.
Implementation: Write code based on your design. Make sure to update the design documents if the design changes. Make sure to make extensive use of runtime assertions, data abstraction, and representation invariants.
Testing: Execute your testing strategy and provide end-to-end testing of the system.
Reflections: Write a brief commentary saying what you learned from this experience. What was easy? What was hard? What was unexpected? Briefly evaluate your solution, pointing out its key merits and deficiencies. This is an individual activity.

Deliverables and Grading

There are three deadlines for this project listed at the top of this document, each with its own deliverables.

For the first deadline (11:59 PM, November 27, 2012):

Team contracts, which should be in a file called TeamContract.pdf.
the abstract designs and protocol uploaded as one file called Design.pdf.

For the second deadline (11:59 PM, December 4, 2012):

the code designs and testing strategy uploaded as Java files.
the user interface design uploaded as Sketches.pdf.
and a demo of some working portion of the system.

The purpose of the demo is to demonstrate progress, and to help you focus on understanding a critical or high-risk area of the design. It might include, for example, a basic server that sends and receives messages but without a GUI client (see the the hints about telnet). Or you might have a working basic GUI with no server backend but a simple API for connecting to one. Talk to your TA beforehand if you are unsure about what is sufficient.

For the third and final deadline (11:59 PM, December 12, 2012):

the implementation: push your final implementation to the remote repositories.
Your reflections on the project.

Reflections on Team. How did you feel the group did? How did your team work? How was the coding? How did you split the work?
Reflections on Individual. How do you think you did? What did you do in the project? How do you feel about it?

Note: Reflections are uploaded by each member individually on Stellar as a Reflections.pdf file.

The grading breakdown is as follows:

30% for the abstract design, architecture, protocol, and usability design
25% for the code design, initial demo, and testing strategy
40% for implementation and testing
5% for reflection

Each deliverable should be submitted in the deliverables folder of your project repository. After each of the first two deliverables, you will meet with your TA the next day during the lecture period to review them and plan your next steps. Be prepared to do your demo for your TA at your second meeting. You should bring a printout of your deliverable to these meetings. For your user interface sketches, scan the files and place them in your PDF document (see resources for a list of scanners on campus), or use a drawing tool.

Hints

Defining documents and edits. Perhaps the hardest and most interesting part of this project is the abstract design, in which you will determine what constitutes a document, and what an edit is. Is a document more like a chat room or more like an email message? How are documents named, and do the names have a context? Does an edit occur at a point in the document, or are global changes allowed? If it occurs at a point, what happens when two users edit simultaneously? Will both their edits be reflected or might one override the other? Are some edits independent and some interfering, and if so how is interference resolved? The most important advice we can give you here is to keep it simple. You can always extend your program, but if you've created a complex mess, it's hard to go back.

Designing architecture and protocol. You must also devise a network architecture and a protocol for this project. A client/server architecture is potentially the easiest choice here, but it isn't required.

You should strongly consider using a text-based protocol, which is generally easier for testing and debugging. We used text-based protocols in the problem sets. Services that use plain text protocols — e.g. HTTP or SMTP — can talk to a human just as well as another machine by using a client program that sends and receives characters. You've already used telnet as a tool for interacting with text-based protocols in the problem set. Make use of it here too.

Handling multiple users. Since realtime collaborative editing is not realtime without at least two people, your system must be able to handle multiple users connected at the same time. One reasonable design approach could be using one thread for reading input from each client and having a central state machine representing the state of the server (using one more thread, to which each of the client threads pass messages through a shared queue).

Design for safe concurrency. In general, making an argument that an implementation is free of concurrency bugs (like race conditions and deadlocks) is very difficult and error-prone. The best strategy therefore is to design your program to allow a very simple argument, by limiting your use of concurrency and especially avoiding shared state wherever possible. For example, one approach is to use concurrency only for reading sockets, and to make the rest of the design single-threaded.

And recall that even though user interfaces are concurrent by nature, Swing is not thread-safe. Recommended reading: Threads and Swing.

Design for testability. To make it possible to write unit tests without having to open socket connections and parse streams of responses, you should design your components so that they can be driven directly by a test driver -- e.g. by calling methods or by putting messages into a queue.

Testing GUIs is particularly challenging. Follow good design practice and separate as much functionality as possible into modules you can test using automated mechanisms. You should maximize the amount of your system you can test with complete independence from any GUI.

Resources

List of available scanners to scan your User Interface Design sketches.

New Media Center at 26-139, an Athena cluster with dozens of Macs and 1 flatbed scanner. Log in to the attached Mac and use Applications -> ImageCapture to scan.
SIPB at W20-557 (outside the W20 Athena cluster) has two flatbed scanners available for general use.
Stata Center scanner on second floor, available to CSAIL members.
Simmons Hall Athena Cluster has one flatbed scanner. Requires your own computer and of course, permission to enter the dorm.

6.005 Elements of Software Construction | Fall 2012 Project 2: Realtime collaborative editing

Due: November 27, 2012 @ Midnight; December 4, 2012 @ Midnight; and December 12, 2012 @ Midnight