September 27, 2014

Building OS X Apps with JavaScript

OS X Yosemite introduced JavaScript for Automation. This makes it possible to access native OS X frameworks with JavaScript. I’ve been digging in to this new world and putting together examples along the way. In this post I’ll explain the basics and step through building a small example app.

WWDC 2014 included a session on JavaScript for Automation. The session explained that you would be able to use JavaScript in place of AppleScript to automate applications. That alone is exciting news. Being able to automate repetitive tasks using AppleScript has been around a long time. AppleScript is not the most fun thing to write so using a familiar syntax in its place is welcome.

During the session the presenter explains the Objective-C bridge. This is where things get super cool. The bridge allows you to import any Objective-C framework into JS applications. For example, if you want to build a GUI using standard OS X controls you would import Cocoa:

ObjC.import("Cocoa");

The Foundation framework does what its name suggests. It provides the building blocks for OS X apps. The framework includes a large amount of classes and protocols. NSArray, NSURL, NSUserNotification, and so on. You may not be familiar with those classes, but their names hint at what they do. Because of its importance, you can use its classes without needing to import it in new apps. They are available by default.

As far as I can tell, you can use JavaScript to build anything that you can with Objective-C or Swift.

Building an example app

Note: You need Yosemite Developer Preview 7+ for the examples to work.

The best way to get going is to jump in and build something. I’m going to walk through building a small app that lets you display an image from your computer.

You can download the completed example from my example apps repo.

Making up the app is; a window, text label, text input, and button. Or by their class names; NSWindow, NSTextField, NSTextField, and NSButton.

Clicking the Choose an Image button will show an NSOpenPanel that allows you to select a file. We’ll configure the panel to restrict the file selection to .jpg, .png, or .gif.

After selecting an image, we'll display it in the window. The window will resize to match the width and height of the image plus the height of the controls. We’ll also set a minimum width and height for the window to make sure the controls don’t get cut off.

Setting up the project

Open the Apple Script Editor application located in Applications > Utilities. Script Editor ain’t the best editor I’ve ever used, but it’s necessary for now. It has a bunch of features that we need for building JS OS X apps. I’m not sure what goes on beyond the scenes, but it can compile and run your scripts as apps. It also creates extra stuff that we need like an Info.plist file. My guess is there are ways to make other editors do the same, but I haven’t looked into it yet.

Create a new document with File > New or cmd + n. The first thing we need to do is save the document as an application. Save with File > Save or cmd + s. Don’t confirm the save right away. There are two options that are necessary to make this run as an application.

The Script Editor Save dialog with important fields highlighted

Change the File Format to “Application”. Check the “Stay open after run handler” option. It’s important to set both of these options the first time you save the file because–as far as I can tell–there is no way to set them later. If you miss one of these you’ll have to create a new document and copy your code over.

~~If anyone knows a way to change the settings after creation, let me know.~~

Update: A nice person explained how to update the settings. Open the File menu and then hold down the option key. That will give you the “Save as...” option. The save dialog allows you to make changes to the settings. It’s still best to set these on first save to avoid the hassle.

If you don’t select “Stay open after run handler” your app will open, flash for a split second, and then close. There is not much in the way of documentation for this stuff online. I only learned this detail after hours of forehead/keyboard merging.

Let’s make something do something!

Add the following two lines to your script and then run it with Script > Run Application or opt + cmd + r.

ObjC.import("Cocoa");
$.NSLog("Hi everybody!");

Not much happened. The only visible changes are in the menu bar and dock. The app name along with File and Edit menus are in the menu bar. You can see the app is running because its icon is in your dock.

Where is the “Hi everybody!”? What is with the dollar sign, what is this, jQuery? Quit the app with File > Quit or cmd + q and we’ll find out where that NSLog went.

Open the Console app: Applications > Utilities > Console. Every application can log messages to the console. This console is not much different than Developer Tools in Chrome, Safari, or Firefox. The main difference is you use it for debugging applications instead of websites.

There’s a lot of messages in the console. You can filter it down by typing “applet” into the search field in the top right corner. Leave “applet” in the search field and go back to Script Editor. Run the application again with opt + cmd + r.

A screenshot showing a log message in Console.app

Did you see it!? The message “Hi everybody!” should have logged to the console. If you didn’t see it, quit your application and run it again. A lot of times I forget to quit and the code doesn’t run again.

What about the dollar sign?

The dollar sign is your access to the Objective-C bridge. Any time you need to access an Objective-c class or constant, you use $.foo or ObjC.foo. There are a couple of other ways to use $ that I’ll cover later.

Console App and NSLog are indispensable tools, you’ll use them non-stop for debugging. For examples of logging things other than strings, have a look at my NSLog example.

Creating the window

Let’s make something we can see and interact with. Update your script to look like the following code:

ObjC.import("Cocoa");

var styleMask = $.NSTitledWindowMask | $.NSClosableWindowMask | $.NSMiniaturizableWindowMask;
var windowHeight = 85;
var windowWidth = 600;
var ctrlsHeight = 80;
var minWidth = 400;
var minHeight = 340;
var window = $.NSWindow.alloc.initWithContentRectStyleMaskBackingDefer(
  $.NSMakeRect(0, 0, windowWidth, windowHeight),
  styleMask,
  $.NSBackingStoreBuffered,
  false
);

window.center;
window.title = "Choose and Display Image";
window.makeKeyAndOrderFront(window);

When that’s in place, run the app. opt + cmd + r. Now we’re talking! With a small amount of code we’ve built an app that launches a window with a title that we can move, minimize, and close.

If you’ve never built an app with Objective-C or Cocoa–like me–parts of this might look like a lot of gibberish. For me, that was due to the length of the method names. I enjoy descriptive names, but Cocoa takes it to the extreme.

Looking past that though, this is JavaScript. This code looks just like code you’d write when building a website.

What’s going on with that first new line where we set styleMask? You use style masks to configure windows. Each style option says what it adds; a title, a close button, a minimize button. These options are constants. Use a pipe “|” to separate options. That pipe is a C bitwise OR operator. I’m not gonna pretend to know what that means. I just know it’s needed to combine style options for a mask and that’s good enough for me.

There are more style options. You can read about them in the docs. NSResizableWindowMask is one you’ll use. Try adding it to the style mask to see what it does.

There are a couple interesting things about syntax that you’ll need to remember. $.NSWindow.alloc calls the alloc method of NSWindow. Notice there are no parenthesis “()” after alloc. In JavaScript that’s how we access properties, not how we call a methods. What’s with that? In JS for OS X parenthesis are only allowed when calling a method if you pass arguments to it. If you use parenthesis with no arguments, you will get a runtime error. Check Console for errors when things don’t do what you think they should.

The next thing is about that super long method name:

initWithContentRectStyleMaskBackingDefer

Take a look at the NSWindow docs for that method and you’ll notice it looks a bit different.

initWithContentRect:styleMask:backing:defer:

In Objective-C you would create the same window we did like this:

NSWindow* window [[NSWindow alloc]
  initWithContentRect: NSMakeRect(0, 0, windowWidth, windowHeight)
  styleMask: styleMask,
  backing: NSBackingStoreBuffered
  defer: NO];

The thing to note are the colons “:” in the original method signature. When you convert and Objective-C method to JS you remove the colons and capitalize the first letter following it. When you see two items in brackets “[]”, that is calling a method of a class or object. [NSWindow alloc] calls the alloc method of NSWindow. For JS, convert those to dot-notation and parenthesis if necessary; NSWindow.alloc.

I think the rest of the code to create and display the window is straightforward-enough. I’ll skip any more detailed description of it. It takes time and a lot of reading the docs to understand what’s happening for each step along the way, but you’ll get there. As long as the window is showing up you’re doing great. Let’s do more.

Adding the controls

We need a label, text field, and button in this window. We’ll use NSTextField and NSButton to make that happen. Update your script with the following code and then run the app.

ObjC.import("Cocoa");

var styleMask = $.NSTitledWindowMask | $.NSClosableWindowMask | $.NSMiniaturizableWindowMask;
var windowHeight = 85;
var windowWidth = 600;
var ctrlsHeight = 80;
var minWidth = 400;
var minHeight = 340;
var window = $.NSWindow.alloc.initWithContentRectStyleMaskBackingDefer(
  $.NSMakeRect(0, 0, windowWidth, windowHeight),
  styleMask,
  $.NSBackingStoreBuffered,
  false
);

var textFieldLabel = $.NSTextField.alloc.initWithFrame($.NSMakeRect(25, (windowHeight - 40), 200, 24));
textFieldLabel.stringValue = "Image: (jpg, png, or gif)";
textFieldLabel.drawsBackground = false;
textFieldLabel.editable = false;
textFieldLabel.bezeled = false;
textFieldLabel.selectable = true;

var textField = $.NSTextField.alloc.initWithFrame($.NSMakeRect(25, (windowHeight - 60), 205, 24));
textField.editable = false;

var btn = $.NSButton.alloc.initWithFrame($.NSMakeRect(230, (windowHeight - 62), 150, 25));
btn.title = "Choose an Image...";
btn.bezelStyle = $.NSRoundedBezelStyle;
btn.buttonType = $.NSMomentaryLightButton;

window.contentView.addSubview(textFieldLabel);
window.contentView.addSubview(textField);
window.contentView.addSubview(btn);

window.center;
window.title = "Choose and Display Image";
window.makeKeyAndOrderFront(window);

If things worked you’ll now have a window with controls. You can’t type in the text field and the button doesn’t do anything, but hey, we’re getting somewhere.

Label, field, and button elements in the window

What did we do in that addition? textFieldLabel and textField are similar. They’re both instances of NSTextField. We make them in a similar way as we made the window. When you see initWithFrame and NSMakeRect, it’s a good sign that the script is creating a UI element. NSMakeRect does what it says. It makes a rectangle with the given position and dimensions; (x, y, width, height). That creates what’s called a struct in Objective-C. In JavaScript we’d refer to it as an object or hash or maybe dict. It’s a thing with keys and values.

After creating the text fields we set a handful of properties on each to get the results we’re looking for. Cocoa doesn’t have anything like the html label element. For that we make our own by disabling editing and background styling.

We’ll populate the text field programmatically, so we disable editing on it as well. If we didn’t need that, creating a standard text field is a one-liner.

For the button we use NSButton. Like the text fields, creating it requires drawing a rectangle. There are two properties that stand out; bezelStyle and buttonType. The values for both are constants. These properties control how the button will render and the style it will have. Check out the NSButton docs to see everything you can do with it. I also have an example app that shows the different button types and styles in action.

The last new thing we do is add the elements to the window with addSubview. When I first tried this I did window.addSubview(theView). That works for other standard views that you create with NSView, but not for instances of NSWindow. I’m not sure why that is, but for windows you need to add sub views to the contentView. The docs describe it as; “The highest accessible NSView object in the window’s view hierarchy”. Works for me.

Making the button do something

When clicking the Choose an Image button, we want a panel to open that shows files on our computer. Before we do that, let’s warm up by logging a message to the console when clicking the button.

In JavaScript you bind event listeners to elements to handle clicks. Objective-C doesn’t use quite that same concept. It has what’s called message passing. With it, you send a message containing a method name to an object. That object needs to have information about what to do when it receives a message containing the method name. That may not be a spot-on description, but it’s how I understand it.

The first thing we need to do is set the button target and action. The target is the object we want to send the action to. If this doesn’t make sense now, just keep going and it’ll get more clear when you see the code. Update the button setup part of the script with the following properties.

...
btn.target = appDelegate;
btn.action = "btnClickHandler";
...

appDelegate and btnClickHandler don’t exist yet. We need to make them. In the next addition to the script, order matters. I put comments in the following code to show where you should add the new stuff.

ObjC.import("Cocoa");

// New stuff
ObjC.registerSubclass({
  name: "AppDelegate",
  methods: {
    "btnClickHandler": {
      types: ["void", ["id"]],
      implementation: function (sender) {
        $.NSLog("Clicked!");
      }
    }
  }
});

var appDelegate = $.AppDelegate.alloc.init;
// end of new stuff

// Below here is in place already
var textFieldLabel = $.NSTextField.alloc.initWithFrame($.NSMakeRect(25, (windowHeight - 40), 200, 24));
textFieldLabel.stringValue = "Image: (jpg, png, or gif)";
...

Run the app, click the Choose Image button, and watch Console. Do you see “Clicked!” when you click the button? If so, that’s pretty darn cool right? If not, double check your script matches the code here and look for any errors in Console.

Subclassing

What’s this ObjC.registerSubclass all about? Subclassing is a way to create a new class that inherits from another Objective-C class. Side note; the likelihood I’m using incorrect terminology here is through the roof. Bear with me. registerSubclass takes a single argument; a JS object with the members of the new object. Members can be; name, superclass, protocols, properties, and methods. I’m not 100% sure that’s an exhaustive list, but that’s what’s in the release notes.

That’s all well and good, but what did we do there? Since we didn’t provide a superclass we inherited from NSObject. That’s the root class of most Objective-C classes. Setting a name let’s us reference the object later through the bridge with $ or ObjC.

$.AppDelegate.alloc.init; creates an instance of our AppDelegate class. Again, notice we don’t use parenthesis with the alloc or init methods since we’re not passing arguments.

Subclass methods

You create a method by giving it any string name. In this case “btnClickHandler”. Give it an object with types and implementation members. I haven’t found official docs for what the types array should contain. Through trial and error I found it goes like this:

["return type", ["arg 1 type", "arg 2 type",...]]

btnClickHandler won’t return anything so we set the return type to void. It takes one parameter, the sender object. In this case it will be the NSButton we named btn. We use the “id” type because represents any object.

The full list of types is in the release notes.

implementation is a normal function. Inside of it you write JavaScript. You have the same access to the $ bridge as you do outside of an object. You also have access to variables you create outside the function.

A quick note on using protocols

You can implement existing Cocoa protocols with sublcasses, but there’s a “gotcha”. I found out that if you use the protocols array your script will just stop with no errors. I wrote an example and explanation that you’ll want to have a look at if you’re doing that type of thing.

Choosing and displaying images

We’re ready to open the panel, choose an image, and display it. Update the btnClickHandler implementation function with following code:

...
implementation: function (sender) {
  var panel = $.NSOpenPanel.openPanel;
  panel.title = "Choose an Image";

  var allowedTypes = ["jpg", "png", "gif"];
  // NOTE: We bridge the JS array to an NSArray here.
  panel.allowedFileTypes = $(allowedTypes);

  if (panel.runModal == $.NSOKButton) {
    // NOTE: panel.URLs is an NSArray not a JS array
    var imagePath = panel.URLs.objectAtIndex(0).path;
    textField.stringValue = imagePath;

    var img = $.NSImage.alloc.initByReferencingFile(imagePath);
    var imgView = $.NSImageView.alloc.initWithFrame(
    $.NSMakeRect(0, windowHeight, img.size.width, img.size.height));

    window.setFrameDisplay(
      $.NSMakeRect(
        0, 0,
        (img.size.width > minWidth) ? img.size.width : minWidth,
        ((img.size.height > minHeight) ? img.size.height : minHeight) + ctrlsHeight
      ),
      true
    );

    imgView.setImage(img);
    window.contentView.addSubview(imgView);
    window.center;
  }
}

The first thing we do is create an instance of an NSOpenPanel. If you’ve ever selected a file to open, or a location to save a file you’ve seen panels in action.

We only want the app to open images. Setting allowedFileTypes lets us specify the allowed types for the panel. It expects an NSArray. We create a JS array with allowedTypes, but we need to convert that to an NSArray. We make that conversion with $(allowedTypes). This is another use of the bridge. You use it in this way to bridge a JS value to Objective-C. To bridge an Objective-C value to JavaScript you’d use $(ObjCThing).js.

We open the panel with panel.runModal. That pauses code execution. When you click Cancel or Open, the panel will return a value. If you clicked Open, the contstant value of $.NSOKButton is returned.

The other note about panel.URLs is important. In JS, we access the first value of an array with array[0]. Because URLs is an NSArray we can’t use bracket notation. Instead we use the objectAtIndex method. They both give the same results.

Once we have the URL of the image we can create a new NSImage. Since it’s common to create an image from a file URL, there’s a handy method doing it:

initByReferencingFile

We create an NSImageView using the same process we did for creating other UI elements. imgView handles displaying the image.

We want to change the width and height of our window to match the width and height of the image. We also want to make sure we don’t go below a minimum width or height. We use setFrameDisplay to change the size of the window.

We wrap things up by setting the image view image and adding it to the window. Since its width and height have updated we re-center the window.

And that’s our little app. Kick it around, open a bunch of images. And yes, animated gifs will display and play so be sure to try some.

Tidbits

So far, we’ve been running the app from Script Editor with opt + cmd + d. You run the actual app by double clicking its icon like any other application.

You can update the app icon by replacing /Contents/Resources/applet.icns. To access the app resources, right click on the app icon and choose “Show Package Contents”.

Why I’m excited

I’m excited about this because I think it has a lot of potential. Here’s how I’ve been thinking about it. When Yosemite is available, anyone that installs the OS will be able to sit down and write a native application. They will be able to do so using one of the most ubiquitous programming languages that exists. They won’t need to download or install anything extra. You don’t even have to install or open Xcode if you don’t want to. That removes a massive barrier to entry. It’s incredible.

I know there’s a whole lot more to building OS X apps than just being able whip up a quick script. I don’t have any illusions that JavaScript will become the defacto way to build for the Mac. I do think this will allow developers to build small apps that make development easier for themselves and for other people. Have a team member that’s not so comfortable using the command line? Build a quick GUI for them. Need a quick, visual way to create and update a large configuration file? Make a little app for it.

Other languages have these capabilities. Python and Ruby have access to the same native APIs and people make apps using those languages. Something just feels different about being able to use JavaScript. It feels subversive. It feels like some of the DIY of the Web knocking on the door of the Desktop. Apple left the door unlocked, I’m inviting myself in.

Thanks for reading