Is interop with clojure.lang.RT an anti-pattern in clojure? / consider adding iter to clojure.core

Question

Is interop with clojure.lang.RT an anti-pattern in clojure? / consider adding iter to clojure.core

asked Mar 8, 2021 in Clojure by borkdude
edited Mar 8, 2021 by borkdude

I've seen several occurrences of code using clojure.lang.RT/iter to retrieve an iterator for a Clojure collection.

(iterator-seq (clojure.lang.RT/iter [1 2 3]))

See e.g.:

https://github.com/noprompt/meander/commit/d2310daaa485afb4e15ceda72aa57f97ea90f284

and:

https://github.com/wilkerlucio/cljc-misc/blob/bb3c8016cace18db5caa5fe0aa5df7a507935f8d/src/main/com/wsscode/misc/coll.cljc#L262

For compatibility of babashka with Clojure I would like to know if I should expose clojure.lang.RT. I'm not entirely convinced that it should be, as clojure.lang.RT is probably an implementation detail of Clojure.

Instead, maybe Clojure can expose the RT/iter method as a clojure.core function?

I "grasped" my local .m2 for usages of clojure.lang.RT and excluded clojure itself:

[clojure.lang.RT/loadClassForName 15]
[clojure.lang.RT/iter 5]
[clojure.lang.RT/loadLibrary 4]
[clojure.lang.RT/assoc 4]
[clojure.lang.RT/classForName 3]
[clojure.lang.RT/load 2]

Note that ClojureScript has an iter function:

$ plk
ClojureScript 1.10.597
cljs.user=> (iter [1 2 3])
#object[cljs.core.RangedIterator]
cljs.user=>

$ clj
Clojure 1.10.1
user=> (iter [1 2 3])
Syntax error compiling at (REPL:1:1).
Unable to resolve symbol: iter in this context

2 Answers

Wilker Lúcio · Answer 1 · 2021-03-08T14:20:12+0000

commented Mar 8, 2021 by alexmiller

commented Mar 8, 2021 by Wilker Lúcio

commented Mar 8, 2021 by alexmiller

alexmiller · Answer 2 · 2021-03-08T14:24:33+0000

commented Mar 8, 2021 by borkdude

commented Mar 8, 2021 by alexmiller

commented Mar 8, 2021 by Wilker Lúcio

The main case I need iterators is to create a custom map type.

The exact detail is how to implement a custom `keys` for the type.

The way Clojure does keys is by first triggering `seq` on the type, and also send the type as you can see here: https://github.com/clojure/clojure/blob/57f7c6ea164271d92cec5a12d22043c7029e3de2/src/jvm/clojure/lang/APersistentMap.java#L147-L154

This implementation requires the type to also implement `Iterable`, which must return an `iterator`.

This is where I mostly need the `iter` function that is only available in `RT`, because there is an underlying structure and I'm re-using its iterator.

The current solution to make it cross platform is:

```
(defn iterator
  "CLJC utility to get an iterator from the collection."
  [coll]
  #?(:clj
     (if (instance? java.lang.Iterable coll)
       (.iterator ^java.lang.Iterable coll)
       (.iterator (seq coll)))

     :cljs
     (iter coll)))
```

So although I don't really want to use an iterator, this case forces me to.

commented Mar 8, 2021 by alexmiller

commented Mar 8, 2021 by Wilker Lúcio

commented Mar 8, 2021 by alexmiller

commented Mar 9, 2021 by Joel Holdbrooks

> they are both about building an iterator function neither of which is ever called in those projects

Alex, I want to clarify that this is incorrect. Meander does, in fact, use the `iter` function via generated macro code to implement some semantics a bit more easily (simpler code generation) in some situations. While I've since realized an opportunity to implement this without `iter`, at the time I didn't have that solution and `iter` solved my problem.

You acknowledge these cases exist and this is the reason some Clojure internals haven't been "locked away". You acknowledge the provided JVM implementation of `iter` given by Wilker is "perfectly fine". Yet you don't think it should be part of the public API or a good idea but you don't give a clear reason why these are your thoughts.

> I do not think `iterator` is a function that should be widely used or promoted

I agree. Many functions in core exist which are not widely used nor promoted but nevertheless are there for their utility. In other words, how promotable or how wide the use case is can't be the reason for excluding `iter`.

> Iterators, in general, are very un-clojurey.

This is an exclusive, essentialist position. Its like, your opinion, man. :^)

Clojure includes numerous opportunities to use "stateful and generally not concurrency friendly" things in the core library and, to your point, makes those things accessible on purpose. So whatever "clojurey" includes, from where I'm sitting, it seems include those "un-clojurey" things.

Now, let me ask, what is the arbitrary criteria a function must meet before "should" be considered?

commented Mar 9, 2021 by alexmiller

Just stepping way back here, deciding what goes in the core api / language is a matter of collecting problems people are encountering and sifting all those to find patterns and deciding based on frequency, severity, level of workarounds, etc which problems are the most important to address.

The best way to provide input to that process is describe as clearly as possible the problem you have. When you ran into this in Meander, why did you want an iterator? What was the actual problem at hand? What other things did you consider? Why weren't they sufficient?

Wilker's description of needing to implement the iterator method of his own collection type with a backing coll is a useful example. Adding `iter` to core is one possible solution. Another is to write your own function using existing Java apis, as he did. Another might be to encapsulate the general need to build custom collection types into something inside Clojure, or in a library. etc. I do note that the scope of need here is not common (most people do not create custom collection types), he solved his problem without having an `iter` function using available Clojure/Java APIs, and I have no other examples of someone with the same problem (but maybe they exist).

My general bias is against iterators. iterators are dangerous and imperative in ways seqs and reducibles are not and it's no accident that it's not part of the Clojure API. This is not just my opinion, it's Rich's opinion which I am transmitting. He's written/talked about this in lots of places:

* https://clojure.org/about/functional_programming#_extensible_abstractions
* https://clojure.org/reference/sequences
* https://clojure.org/news/2012/05/08/reducers
* https://github.com/matthiasn/talk-transcripts/blob/master/Hickey_Rich/ClojureIntroForLispProgrammers.md

Also, we had extensive discussions about this during the design and implementation of transducers, which lean heavily on iterators in the implementation but encapsulate their use for safety.

Even given all that, I don't think it's off the table, but the level of need does not seem high enough to overcome that bias to me so far. If you want to change my mind, bring me problems.

commented Mar 10, 2021 by Joel Holdbrooks

Everything below is bracketed in "I have since derived a newer solution which does not rely on `iter` but at the time I came to `iter` to solve my problem it did what I needed it to."

> When you ran into this in Meander, why did you want an iterator?

Being pedantic, I did not *want* an iterator initially but it ended up "solving" my problem which I will detail below.

> What was the actual problem at hand?

There few intersecting pieces here. Meander has a concept of a pattern substitution which you can think of as sort of the dual to pattern matching. Matching applies a pattern to an object and returns a set of bindings, substitution applies bindings to a pattern and returns an object. The compilation of substitution patterns — in particular repeated patterns containing a type of variable known as a "memory variable" — is where this problem arises.

A memory variable is a variable that is bound to a fifo. Each occurrence of a memory variable in a pattern "disperses" or the first value from the fifo and updates the binding of the variable to the remaining elements in the fifo. This continues until the fifo is empty and is said to be "exhausted". When memory variables appear in a repeated pattern (think Kleene star), then the repeated pattern produces values while at least one of the memory variables is not exhausted. When repeated patterns are compiled to Clojure code, there is a check to see if the pattern contains memory variables. If so, it then compiles iterators for each of the memory variables each of which can be tested for exhaustion via `hasNext` and values can be dispersed with, of course, `next`. While this is not an ideal solution, the semantics I wanted where easy to implement this way.

> What other things did you consider?

Initially, I tried a pure solution but this lead to a lot complexity and headache. This has more to do with the poor quality of the design which relies on `iter` though and less with a pure approach.

I also tried `volatile!` but this produced incorrect results in some cases.

Its been a while but I think considered using an array too but `iter` was there and did what I wanted it to so I didn't bother with it.

> Why weren't they sufficient?

Because either there were complexities that as a maintainer I didn't wish to occupy myself with, or because they resulted in an incorrect implementation of the semantics.

commented Mar 10, 2021 by Joel Holdbrooks

> I don't think it's a good idea that CLJS has `iter`.

This is the heart of the issue. CLJS has always been treated sort of like a dialect of Clojure resulting in many situations like this one. While I understand there is some bias against formally specifying parts of the language, having at least some minimal definition of primitives and what symbols are resolvable by default in an empty namespace could serve to mitigate these issues for future implementations. Personally, I think the `ns` macro is part of the problem because it is pervasive (`in-ns` is more primitive but rarely used; CLJS requires it), and it refers core, the contents of which are arbitrarily based on what the implementation thinks should be there; issues like this one are a natural consequence.

If a future design of the language included a minimal definition of primitives such that clean separation of platform specific symbols could be achieved easily, these issues of disparity between CLJ and CLJS or other implementation would likely vanish.

FWIW I'm also speaking from experience. Over the past 5 years I have *partially* implemented a Clojure interpreter/compiler for Ruby 2.X.X, a small step symbolic interpreter for Clojure, and a partial evaluator for Clojure. In each case I was never able to complete the projects for lack of a clear language definition both in syntax and semantics. For example, there are special forms like `case*` which are not documented and require inspection of the compiler to determine their semantics.

Anyway, I make no demands here. I only wish to share my observations. I think there is a possible future for Clojure where issues like this do not exist. As someone who has programmed in the language professionally and as a hobby for nearly a decade, I think some specification of the language would pave the way to that future.

commented Mar 14, 2021 by Andy Fingerhut

Is interop with clojure.lang.RT an anti-pattern in clojure? / consider adding iter to clojure.core

Please log in or register to add a comment.

Please log in or register to answer this question.

2 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Categories

Is interop with clojure.lang.RT an anti-pattern in clojure? / consider adding iter to clojure.core

Please log in or register to add a comment.

Please log in or register to answer this question.

2 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

Categories