Can I make this routine for scoring a graph bisection more efficient?

Question

Can I make this routine for scoring a graph bisection more efficient?

asked Sep 25, 2022 in Clojure by Rik Belew

My code is spending most of its time scoring bisections: determining how many edges of a graph cross from one set of nodes to the other.

Assume bisect is a set of half of a graph's nodes (ints), and edges is a list of (directed) edges [ [n1 n2] ...] where n1,n2 are also nodes.

(defn tstBisectScore
  "number of edges crossing bisect"
  ([bisect edges]
   (tstBisectScore bisect 0 edges))

  ([bisect nx edge2check]
   (if (empty? edge2check)
     nx

     (let [[n1 n2] (first edge2check)
           inb1 (contains? bisect n1)
           inb2 (contains? bisect n2)]
       (if (or (and inb1 inb2)
               (and (not inb1) (not inb2)))
         (recur bisect nx (rest edge2check))
         (recur bisect (inc nx) (rest edge2check))))

     )))

The only clues I have via sampling the execution of this code (using VisualVM) shows most of the time spent in clojure.core$empty_QMARK_, and most of the rest in clojure.core$contains_QMARK_. (first and rest take only a small fraction of the time.)

Any suggestions as to how I could tighten the code?

1 Answer

Eugene Pakhomov · Answer 1 · 2022-09-25T21:27:21+0000

commented Sep 26, 2022 by Rik Belew
edited Sep 26, 2022 by Rik Belew

thanks very much @Eugene, these seem like excellent suggestions. i implemented both your peek/pop and IPersistentSet ideas:

   (defn tstBisectScore2
     ([bisect edges]
      (tstBisectScore2 bisect 0 (vec edges)))

     ([bisect nx edge2check]
      (if (zero? (count edge2check))
        nx

        (let [[n1 n2] (peek edge2check)
              inb1 (.contains ^clojure.lang.IPersistentSet bisect n1)
              inb2 (.contains ^clojure.lang.IPersistentSet bisect n2)]
          (if (or (and inb1 inb2)
                  (and (not inb1) (not inb2)))
            (recur bisect nx       (pop edge2check))
            (recur bisect (inc nx) (pop edge2check))))

        )))

but timing old vs. new shows only slight (6%) improvement?

   (time (dotimes [i 100000]
           (hpclj.test/tstBisectScore1 rp g6fe)))
   "Elapsed time: 6103.62275 msecs"

   (time (dotimes [i 100000]
           (hpclj.test/tstBisectScore2 rp g6fe)))
   "Elapsed time: 5732.2645 msecs"

i know timing clojure has fine-points (startup?), so maybe this simple experiment is insufficient? thanks again in any case.

commented Sep 26, 2022 by Eugene Pakhomov

commented Sep 26, 2022 by Rik Belew

curiouser and curiouser! criterium makes your ideas seem slightly (3%) SLOWER:

   (criterium/quick-bench (tstBisectScore1 rp g6fe))

   Evaluation count : 11148 in 6 samples of 1858 calls.
                Execution time mean : 54.087476 µs
       Execution time std-deviation : 387.196475 ns
      Execution time lower quantile : 53.651538 µs ( 2.5%)
      Execution time upper quantile : 54.485682 µs (97.5%)
                      Overhead used : 1.942216 ns

   (criterium/quick-bench (tstBisectScore2 rp g6fe))

   Evaluation count : 11262 in 6 samples of 1877 calls.
                Execution time mean : 55.667037 µs
       Execution time std-deviation : 1.035787 µs
      Execution time lower quantile : 54.642948 µs ( 2.5%)
      Execution time upper quantile : 57.269671 µs (97.5%)
                      Overhead used : 1.942216 ns

commented Sep 26, 2022 by Eugene Pakhomov

commented Sep 26, 2022 by Rik Belew

commented Sep 29, 2022 by alexmiller

Can I make this routine for scoring a graph bisection more efficient?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Categories

Can I make this routine for scoring a graph bisection more efficient?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories