Clustering involves more network connections and thus more network latencies. Data has to be de/serialized (which involves parsing etc.). Shards etc. have to be managed in a centralized manner.
Depending on your query, (i.e. a sub query that leans on the sorted result of its bearer) parts of the query have to be distributed across the cluster with several round trips involving even more communication.
Clustering is intended to give you rather a higher throughput and access to more computing resources and not the low latency a single server environment can provide.
As long as a single machine can scale to your workload, clustering simply isn't the proper solution. This will change with our upcoming 3.0 version, in which the new synchronous replication gives you fault tolerance and high availability in addition to scalability. Currently you can distribute query load to several machines using replication.
Read more about ArangoDB Cluster performance in Max Blog article which scales to a big environment whilst keeping the latency reasonably low.