[swift-evolution] Inconsistencies related to prepositions

Jon Gilbert via swift-evolution Wed, 02 Aug 2017 08:44:48 -0700

When Swift 3 launched, it introduced a new concept of placing the preposition 
inside the parentheses. (See discussion here: 
https://lists.swift.org/pipermail/swift-evolution/Week-of-Mon-20160208/009523.html).


I'm fine with that, however this change got implemented in an inconsistent 
manner, making Swift and its APIs more vague, thereby decreasing code clarity. 

I was hoping to see these inconsistencies get cleared up with Swift 4, however 
they were not. I realized that perhaps I could have spoken up and taken more of 
an active role in Swift Evolution. So... better late than never. Here are my 
thoughts, towards consistency and disambiguation in Swift.

Disclaimer: I’m sure the last thing anyone wants are more changes to the APIs. 
However, I also don't think these inconsistencies should be left in place 
forever. I have tried to go back and read through as much of the relevant 
discussions on list as I could, and it seems like 

Please let me some give examples below. After that is some discussion and a 
proposal.

EXAMPLES

Take for example the preposition "with." A preposition is meaningless without a 
subject and an object. Look at the following three sentences:

 (A) "A boy with the dog Molly walked across the street."

 (B) "With the dog Molly walked across the street."

 (C) "A boy with Molly walked across the street."

Sentence (A) is longest, but it makes perfect sense.

Sentence (B) is nonsensical and grammatically incorrect because while the word 
"with" takes two arguments, only one is present. If there was a comma after 
"dog," then it would make sense; "With the dog, Molly walked across the street" 
is fine. But if we were to assume that Molly is not the dog, in this case, we'd 
actually be wrong.

Sentence (C), while grammatically correct, leaves it unclear whether Molly is a 
dog, a girl, or… something else.  

The reason for this is, whenever a preposition is used in English, it almost 
always takes a dyadic form, relating a subject to the preposition's object. The 
two most common dyadic formats are:

<subject> [<preposition> <object of preposition>]
<The boy> [<with> <the dog>] crossed the street. 

[<preposition> < object of preposition>] <subject>
[<In> <space>], <no one> can hear you scream.
[<On> <the Moon>] are <many craters>.

Now, in Objective C through Swift 1 and 2, prepositions' dyadic nature were 
generally respected in method signatures. However, Swift 3's migration of the 
preposition inside the parentheses also seems to have been accompanied by the 
stripping away of either the subject, the prepositional object, or 
both—according to no discernible pattern. For example: 

(1) CloudKit:

old: myCKDatabase.fetchRecordWithID(recordID)
new: myCKDatabase.fetch(withRecordID: recordID)
(subject "Record" got removed)

(2) String:

old: myString.capitalizedStringWithLocale(_: myLocale)
new: myString.capitalized(with: myLocale)
(subject "String" and prep. object "Locale" both got removed)

(3) Dictionary:

old: myDict.removeAtIndex(myIndex)
new: myDict.remove(at: myIndex)
(subject "element" already missing from both; prep. object "Index" got removed)

(4) Date:

old: myDate.timeIntervalSinceDate(myDate)
new: myDate.timeIntervalSince(date: myDate)
(subject "timeInterval" and object "Date" both still present; but oddly, 
preposition "since" got left outside of the parentheses)

(5) Array:

            old: myArray.forEach({ thing in code})
new: myArray.forEach() { thing in //code }
            (preposition “for” is outside of the parentheses)

DISCUSSION OF EXAMPLES

These four changes are inconsistent with each other regarding whether the 
subject, prepositional object, or both were removed, and we don't even have 
consistency as to whether the preposition got moved inside the parentheses (see 
A below). 

As well, these changes generally removed the dyadic arguments to the 
preposition, which removes cues necessary to disambiguate the prepositional 
relationship, decreasing code readability (see B below).

(A) Inconsistency

The inconsistency between the examples is shown in the bold text of each 
example, but lets go over why this matters. It matters because any language is 
easier to learn the more consistently it sticks to its own rules. Autocomplete 
is our great savior, but still, if we were being consistent, then the new 
method signatures would have been:

(1) myCKDatabase.fetchRecord(withRecordID:)
(2) myString.stringCapitalized(withLocale:)
(3) myDictionary.elementRemoved(atIndex:)
(4) myDate.timeInterval(sinceDate:)
(5) myArray.each(inClosure: )   

Side note: for plain English readability, we might prefer 
elementRemoved(fromIndex:) and stringCapitlized(accordingToLocale:). 

Although I do understand removing "string" from the latter was to reduce 
redundancy in function/method declarations, we only make one declaration, yet 
we make many calls. So increasing ambiguity in calls does not seem like a good 
trade-off for decreased boilerplate in declarations. More often than not it's 
calls that we're reading, not the declarations—unless of course the call was 
ambiguous and we had to read the declaration to make sense out of it. So 
perhaps we might question if increased ambiguity is an overall good thing. 

Side note: example (5), .forEach, does seem like a very exceptional case, and 
possibly a sacred cow. See the Proposal section for further discussion of this.

(B) Increased Ambiguity

In all of these changes, one of or both parts of the dyadic arguments of the 
preposition have been excised from the method signatures. This increases 
ambiguity in Swift 3/4 vs. Objective C and Swift 2, especially in closures, as 
I will explain below.

In example (1), the old method argument makes grammatical sense because "record 
with RecordID" follows the format, <subject> [<preposition> <object of 
preposition>], just like "boy with dog tag" or "cow with black spots." This 
improves code readability, because when you read this, you know that this 
function will give you a record matching a particular recordID. However in 
Swift 3, we have the equivalent of, "with a recordID"—i.e. is *implied* that 
the thing being fetched is a record. 

This isn't a problem you're reading the method signature in the header, and 
it's not a problem when you're writing the code, because you'll get warnings 
and errors if you try to assign to the wrong type.

However this removal of explicit contextual cues from the method signature 
harms readability, since now, the compiler will let people write code like:

{ return $0.fetch(withRecordID:$1) }

Clearly, the onus is now on the developer not to use cryptic, short variable 
names or NO variable names. However, spend much time on GitHub or in CocoaPods 
and you will see an increasing number of codebases where that's exactly what 
they do, especially in closures.

Another problem is that the compiler doesn't care if you write:

{ ambiguousName in
let myRecordID = ambiguousName.fetch(withRecordID:myID) 
return myRecordID }

This is highly problematic because someone reading this code will have no 
reason to expect the type of "myRecordID" not to be CKRecordID. (In fact, it's 
CKRecord.) There is also no way to clarify what ambiguousName is since closures 
don't have argument labels at all... but that's another problem altogether.

Turning now to example (2), "myString.capitalized(with:myLocale)" sacrifices 
BOTH the subject and the prepositional object. We have now lost any enforced 
contextual cues, fully orphaning "with", allowing and even encouraging a final 
closure argument like: 

{ return $0.capitalized(with: .current)) }

What is $0? Reading this, and being familiar with Swift, you will likely assume 
it's a string, but will you remember that .current is a Locale? The compiler 
knows, but why was it a good idea to strip out that information? 

We also have examples like: 

{ return $0.draw(with:$1) }

What is $0? What is $1? This is a real Apple API, BTW.

We could also have:

{array, key in 
let number = array.remove(at:key)
return number }

This will compile and run even though number will be a tuple key-value pair, 
array will be a dict, and key will be an index type! This may seem like a 
ridiculous example, but I have literally seen things like this.

DISCUSSION

Making computer code more like natural language and thus more approachable to 
non-computer-science types was clearly one of the primary goals of Apple's 
method names in Objective C. 

I've been a fan of the general direction in which Swift has taken things, but 
I'm starting to see code that is quite unreadable as a result of the stripping 
away of useful information from code, as if more white space will improve our 
ability to make sense out of our arcane invocations.

The point of code readability is for humans to be able to read and understand 
code with minimal extra effort. Explicit information is important because 
humans would like to read code without having to dig through five different 
headers and read lots of documentation files. This directly increases costs, 
because when a new developer comes onto a project, the longer it takes them to 
understand the codebase, the more it will cost the company.

I don't believe the intent with these changes was to increase ambiguity; at 
WWDC '16 we were told it was to increase clarity. I'm just not sure that's the 
consistent effect we actually received. 

What we got instead is more times we will need to CMD-click things in XCode to 
get a clue. That's especially annoying because you're at XCode's mercy as to 
whether it actually wants to jump to a given declaration or not. This seems 
pretty hit or miss in my experience, even within a basic playground page, and 
is especially troublesome with jumping to declarations within CocoaPods. 
Selecting some code to get documentation in the side panel seems equally 
unreliable. 

Perhaps CMD-click problems have to do with a project setting, but is there 
really a time when you would ever NOT want to have these capabilities? Why is 
it tied into a setting? Further, when you're reading through a repo on GitHub 
itself, you don't have CMD-click capability; you want the code to be as clear 
as possible, without lots of ambiguity.

From Ambiguous to Cryptic

Orphaning method signatures by stripping useful return type and argument type 
information wouldn't be so bad if variables were all named descriptively, but 
that is a strangely optimistic hope for a language that's as paranoid about 
safety that it was specifically designed to prevent many categories of common 
mistakes.

The increased ambiguity in Swift from the removal of prepositional components 
has been further compounded by other forms of ambiguity that are increasingly 
everywhere in Swift, and which developers have taken copious advantage of to 
the point of abuse. Examples are:

- arguments labelled as "_" reduce code clarity
- calls to functions that take closures as the final argument can leave the 
final argument label out, decreasing readability
- closures have lack explicit type information, leaving it up to variable names 
to let us know what's going on; however, the use $# syntax and/or cryptically 
named variables often leaves it totally ambiguous
- generic types in method signatures using a single letter give no clue as to 
the type's purpose
- function types can't have argument labels, which might give a sense of 
context or informative cues, even if they added boilerplate 
- inferred nested type parents (like the ability to do .current instead of 
being forced to do Location.current)

Why is removing context and clue good?

Is it possible that this was an overreaction to the perceived over-verbosity of 
Objective C? I'm honestly curious. I will try to read back through the 
archives, but perhaps someone can give me a good summary.

In the old Objective C style, method names could be quite long, but I never 
felt a lack of context that made code unclear. The method signatures were 
basically like sentences without spaces, and they seemed to follow pretty 
reliable rules. So reading them actually made grammatical sense, improving code 
readability. I could always tell what a call was doing.

While I won't deny that Objective C went overboard sometimes in verbosity, and 
while few would dispute that we should clean up its style, could it be we are 
going too far making Swift encourage code that's ambiguous to everyone except 
the compiler? Could it be a mistake for Swift to err so far on the side of 
inferring everything and stripping away contextual information?

A first-time reader of some unseen code is not going to be able to infer things 
like the original programmer or the compiler can. This increases the 
possibility of bugs (and exploits) sneaking in "under the radar."

PROPOSAL

To add more clarity back to Swift, I'd propose the following rules be applied 
consistently across the language:

• Every time a preposition shows up in a function/method signature, there is an 
explicit subject and object of the preposition, except in the following 
exceptions.

• The subject of a preposition can be left out if the function/method is 
mutating AND doesn't return a value, such as Array's .append(contentsOf:) 
method. This method would become: .append(contentsOfCollection:) so that it's 
clear what we're doing here, even within a closure.

• No preposition should be used when it would add no extra meaning between 
what’s inside and outside the parentheses. For example, userFont(ofSize 
fontSize:) should just be userFont(size: fontSize), because “of” is completely 
superfluous here. AVFragmentedMovie’s track(withTrackID:) could just as easily 
be track(forTrackID:) or track(TrackID:) which tells us that the preposition is 
superfluous and so why not just have it be track(id:). 

• If the preposition goes with a verb in the method signature, the verb should 
be the last thing before the opening parenthesis (e.g. String's .capitalized 
function would now look like name.stringCapitalized(accordingToLocale: 
.current)... no pun intended).

• In the case of .forEach, since it’s almost identical in function to .map, 
perhaps a solution could be to eliminate .forEach entirely, and simply say that 
anytime .map’s result is unused (or assigned to _), then the compiler will 
automatically use .forEach functionality to optimize performance. For example 
if in Swift 5 you did _ = myArray.map() { print($0) }, this would be the same 
as doing myArray.forEach() { print($0) } currently. As well, _ = could be 
inferred. 

Note: part of the problem is obviously that parameter names are never used in 
function calls when there is an argument label, but often, the argument label 
is now just a preposition, while its prepositional object is the (now hidden) 
parameter name. Perhaps we can think of a novel solution to this?

I'm interested to know what people think about this. Perhaps there are other 
ideas or ways to add clarity back in. Thanks for reading this long message. 

Jon

_______________________________________________
swift-evolution mailing list
[email protected]
https://lists.swift.org/mailman/listinfo/swift-evolution

[swift-evolution] Inconsistencies related to prepositions

Reply via email to